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Preface 


T his book, like the first edition, presents the basic 
concepts and results of logic: the topics are proofs, 
truth, and computability. As before, the presentation 
is directed toward the reader with some mathematical back¬ 
ground and interests. In this revised edition, in addition to 
numerous “local” changes, there are three “global” ways in 
which the presentation has been changed: 

First, I have attempted to make the material more ac¬ 
cessible to the typical undergraduate student. In the main 
development, I have tried not to take for granted informa¬ 
tion or insights that might be unavailable to a junior-level 
mathematics student. 

Second, for the instructor who wants to fit the book to his 
or her course, the organization has been made mor e, flexible. 
Footnotes at the beginning of many of the sections indicate 
optional paths the instructor — or the independent reader — 
might choose to take. 

Third, theoretical computer science has influenced logic 
in recent years, and some of that influence is reflected in this 
edition. Issues of computability are taken more seriously. 
Some material on finite models has been incorporated into 
the text. 

The book is intended to serve as a textbook for an in¬ 
troductory mathematics course in logic at the junior-senior 
level. The objectives are to present the important concepts 
and theorems of logic and to explain their significance and 
their relationship to the reader’s other mathematical work. 

As a text, the book can be used in courses anywhere from 
a quarter to a year in length. In one quarter, I generally reach 
the material on models of first-order theories (Section 2.6). 
The extra time afforded by a semester would permit some 
glimpse of undecidability, as in Section 3.0. In a second 
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term, the material of Chapter 3 (on undecidability) can be more adequately cov¬ 
ered. 

The book is intended for the reader who has not studied logic previously, but who 
has some experience in mathematical reasoning. There are no specific prerequisites 
aside from a willingness to function at a certain level of abstraction and rigor. 
There is the inevitable use of basic set theory. Chapter 0 gives a concise summary 
of the set theory used. One should not begin the book by studying this chapter; it is 
instead intended for reference if and when the need arises. The instructor can adjust 
the amount of set theory employed; for example it is possible to avoid cardinal 
numbers completely (at the cost of losing some theorems). The book contains 
some examples drawn from abstract algebra. But they are just examples, and are 
not essential to the exposition. The later chapters (Chapter 3 and 4) tend to be more 
demanding of the reader than are the earlier chapters. 

Induction and recursion are given a more extensive discussion (in Section 1.4) 
than has been customary. I prefer to give an informal account of these subjects in 
lectures and have a precise version in the book rather than to have the situation 
reversed. 

Exercises are given at the end of nearly all the sections. If the exercise bears a 
boldface numeral, then the results of that exercise are used in the exposition in the 
text. Unusually challenging exercises are marked with an asterisk. 

I cheerfully acknowledge my debt to my teachers, a category in which I include 
also those who have been my colleagues or students. I would be pleased to receive 
comments and corrections from the users of this book. The Web site for the book 
can be found at http://www.math.ucla.edu/~hbe/amil. 
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S ymbolic logic is a mathematical model of deductive 
thought. Or at least that was true originally; as with 
other branches of mathematics it has grown beyond 
the circumstances of its birth. Symbolic logic is a model 
in much the same way that modern probability theory is a 
model for situations involving chance and uncertainty. 

How are models constructed? You begin with a real-life 
object, for example an airplane. Then you select some fea¬ 
tures of this original object to be represented in the model, 
for example its shape, and others to be ignored, for example 
its size. And then you build an object that is like the original 
in some ways (which you call essential) and unlike it in oth¬ 
ers (which you call irrelevant). Whether or not the resulting 
model meets its intended purpose will depend largely on 
the selection of the properties of the original object to be 
represented in the model. 

Logic is more abstract than airplanes. The real-life ob¬ 
jects are certain “logically correct” deductions. For example. 

All men are mortal. 

Socrates is a man. 

Therefore, Socrates is mortal. 

The validity of inferring the third sentence (the conclu¬ 
sion) from the first two (the assumptions) does not depend 
on special idiosyncrasies of Socrates. The inference is jus¬ 
tified by the form of the sentences rather than by empirical 
facts about mortality. It is not really important here what 
“mortal” means; it does matter what “all” means. 

Borogoves are mimsy whenever it is brillig. 

It is now brillig, and this thing is a borogove. 

Hence this thing is mimsy. 
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Again we can recognize that the third sentence follows from the first two, even 
without the slightest idea of what a mimsy borogove might look like. 

Logically correct deductions are of more interest than the above frivolous ex¬ 
amples might suggest. In fact, axiomatic mathematics consists of many such de¬ 
ductions laid end to end. These deductions made by the working mathematician 
constitute real-life originals whose features are to be mirrored in our model. 

The logical correctness of these deductions is due to their form but is indepen¬ 
dent of their content. This criterion is vague, but it is just this sort of vagueness that 
prompts us to turn to mathematical models. A major goal will be to give, within 
a model, a precise version of this criterion. The questions (about our model) we 
will initially be most concerned with are these: 

1. What does it mean for one sentence to “follow logically’’ from certain others? 

2. If a sentence does follow logically from certain others, what methods of 
proof might be necessary to establish this fact? 

3. Is there a gap between what we can prove in an axiomatic system (say for 
the natural numbers) and what is true about the natural numbers? 

4. What is the connection between logic and computability? 

Actually we will present two models. The first (sentential logic) will be very 
simple and will be woefully inadequate for interesting deductions. Its inadequacy 
stems from the fact that it preserves only some crude properties of real-life de¬ 
ductions. The second model (first-order logic) is admirably suited to deductions 
encountered in mathematics. When a working mathematician asserts that a par¬ 
ticular sentence follows from the axioms of set theory, he or she means that this 
deduction can be translated to one in our model. 

This emphasis on mathematics has guided the choice of topics to include. This 
book does not venture into many-valued logic, modal logic, or intuitionistic logic, 
which represent different selections of properties of real-life deductions. 

Thus far we have avoided giving away much information about what our model, 
first-order logic, is like. As brief hints, we now give some examples of the expres¬ 
siveness of its formal language. First, take the English sentence that asserts the 
set-theoretic principle of extensionality, “If the same things are members of a first 
object as are members of a second object, then those objects are the same.” This 
can be translated into our first-order language as 

Vx Vy( Vz(z e x z e y) —* x = y). 

As a second example, we can translate the sentence familiar to calculus students, 
“For every positive number e there is a positive number S such that for any x whose 
distance from a is less than S, the distance between fix) and b is less than e” as 

Ve(e > 0 —* 3 <5(5 > 0 A \/x(dxa < 8 —► dfxb < e))). 

We have given some hints as to what we intend to do in this book. We should 
also correct some possible misimpressions by saying what we are not going to do. 
This book does not propose to teach the reader how to think. The word “logic” is 
sometimes used to refer to remedial thinking, but not by us. The reader already 
knows how to think. Here are some intriguing concepts to think about. 
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Useful Facts 
about Sets 



W e assume that the reader already has 
some familiarity with normal everyday 
set-theoretic apparatus. Nonetheless, we 
give here a brief summary of facts from set theory 
we will need; this will at least serve to establish 
the notation. It is suggested that the reader, instead 
of poring over this chapter at the outset, simply 
refer to it if and when issues of a set-theoretic na¬ 
ture arise in later chapters. The author’s favorite 
book on set theory is of course his Elements of Set 
Theory (see the list of references at the end of this 
book). 

First a word about jargon. Throughout the book 
we will utilize an assortment of standard mathe¬ 
matical abbreviations. We use “H” to signify the 
end of a proof. A sentence “If ..., then ..will 
sometimes be abbreviated “... ....” We also 

have “<=” for the converse implication (for the pe¬ 
culiar way the word “implication” is used in math¬ 
ematics). For “if and only if” we use the shorter 
“iff” (this has become part of the mathematical lan¬ 
guage) and the symbol “<£>.” For the word “there¬ 
fore” we have the abbreviation. 

The notational device that extracts “x f y” as 
the denial of “x = y” and “x y” as the denial 
of “x e y” will be extended to other cases. For 
example, in Section 1.2 we define “E |= r”; then 
“E t” is its denial. 

Now then, a set is a collection of things, called 
its members or elements. As usual, we write “t e 
A” to say that / is a member of A, and “t f A” to 
say that t is not a member of A. We write “x = y” to 
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mean that x and y are the same object. That is, the expression “x” on the 
left of the equals sign is a name for the same object as is named by the 
other expression “y.” If A = B. then for any object t it is automatically 
true that t e A iff t e B. This holds simply because A and B are the 
same thing. The converse is the principle of extensionality: If A and B 
are sets such that for every object t , 

t e A iff t e B. 

then A — B. This reflects the idea of what a set is; a set is determined 
just by its members. 

A useful operation is that of adjoining one extra object to a set. For 
a set A, let A; t be the set whose members are (i) the members of A, 
plus (ii) the (possibly new) member t. Here t may or may not already 
belong to A, and we have 

A; t — A U {r} 

using notation defined later, and 

t e A iff A; t = A. 

One special set is the empty set 0, which has no members at all. Any 
other set is said to be nonempty. For any object x there is the singleton 
set {x} whose only member is x. More generally, for any finite number 
Xi,, x n of objects there is the set {jci, ..., x,,} whose members are 
exactly those objects. Observe that {x, y) — {y, x}, as both sets have 
exactly the same members. We have only used different expressions to 
denote the set. If order matters, we can use ordered pairs (discussed 
later). 

This notation will be stretched to cover some simple infinite cases. 
For example, {0, 1, 2,...} is the set N of natural numbers, and {..., —2, 
— 1,0, 1, 2,...} is the set Z of all integers. 

We write “{x \ _x_}” for the set of all objects x such that _x_. 
We will take considerable liberty with this notation. For example, 
{( m,n) | m < n in N} is the set of all ordered pairs of natural 
numbers for which the first component is smaller than the second. And 
[x e A |_ jc _} is the set of all elements x in A such that _x_. 

If A is a set all of whose members are also members of B, then A is a 
subset of B, abbreviated “A C 5.” Note that any set is a subset of itself. 
Also, 0 is a subset of every set. (“0 C A” is “vacuously true,” since 
the task of verifying, for every member of 0, that it also belongs to A 
requires doing nothing at all. Or from another point of view, “A C B” 
can be false only if some member of A fails to belong to B. If A = 0, 
this is impossible.) From the set A we can form a new set, the power set 
VA of A, whose members are the subsets of A. Thus 


VA = {x | x c A}. 
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For example, 

?0 = { 0 }, 

7>{0} = {0, {0}}. 

The union of A and B, A U B, is the set of all things that are members 
of A or B (or both). For example, A; t — A U {r}. Similarly, the inter¬ 
section of A and B, An B, is the set of all things that are members 
of both A and B. Sets A and B are disjoint iff their intersection is 
empty (i.e., if they have no members in common). A collection of sets 
is pairwise disjoint iff any two members of the collection are disjoint. 

More generally, consider a set A whose members are themselves sets. 
The union, (J A, of A is the set obtained by dumping all the members 
of A into a single set: 

A = [x | x belongs to some member of A}. 

Similarly for nonempty A, 

P'l A = {x | x belongs to all members of A}. 

For example, if 

A = {{0,1, 5}, {1,6}, {1,5}}, 

then 

\JA = {0, 1,5,6}, 

n^ = u}. 

Two other examples are 

AU B = UfA, B), 

U VA = A. 

In cases where we have a set A„ for each natural number n, the union 
of all these sets, [J{A„ | n e N}, is usually denoted “(J ngN A„” or just 

“U 

The ordered pair (x. y) of objects x and y must be defined in such a 
way that 

{x , y) — ( u , v) iff x = u and y — v. 

Any definition that has this property will do; the standard one is 

(*> y) = {M, {x, y}}. 

For ordered triples we define 


( x,y,z) = ({x,y),z). 
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More generally we define n -tuples recursively by 

(*i-,*«+i> = {{x u ...,x n ),x n+ i) 

for n > 1. It is convenient to define also (x ) = x; the preceding equation 
then holds also for n — 1.5 is a finite sequence (or string ) of members 
of A iff for some positive integer n, we have S — (x\,..., x„), where 
each Xj e A. (Finite sequences are often defined to be certain finite 
functions, but the above definition is slightly more convenient for us.) 

A segment of the finite sequence 5 = (xi ,..., x n ) is a finite sequence 

{ Xk , Xfc+i, • • ■, Xffi — i, x m ), where 1 < k < m < n. 

This segment is an initial segment iff k = 1 and it is proper iff it is 
different from S. 

If (jci, ..., x n ) = (yi, ..., y„), then it is easy to see that x; = y,- 
for !<;'<«. (The proof uses induction on n and the basic property 
of ordered pairs.) But if [x\,, x m ) — (vi, ..., y„). then it does not 
in general follow that m = n. After all, every ordered triple is also an 
ordered pair. But we claim that m and n can be unequal only if some x,- 
is itself a finite sequence of v/s, or the other way around: 

Lemma OA Assume that {x \,..., x m ) — (yi,..., y m ,..., y m +k)- 
Then x \ — (yi,-y*+i). 

Proof. We use induction on m. If m = 1, the conclusion is imme¬ 
diate. For the inductive step, assume that (xi,..., x m , x,„ + i) = 
(yi,..., y m +k, y m +i+fc}. Then the first components of this ordered 
pair must be equal: (xi, ..., x m ) — (yi, ..., y m +k )• Now apply 
the inductive hypothesis. H 

For example, suppose that A is a set such that no member of A is a 
finite sequence of other members. Then if (xi,..., x m ) — (yi,..., y„) 
and each x, and y , is in A, then by the above lemma m — n. Whereupon 
we have x,- = y,- as well. 

From sets A and B we can form their Cartesian product, the set 
A x B of all pairs (x, y) for which x e A and y e B. A" is the set of 
all n-tuples of members of A. For example, A 3 = (A x A) x A. 

A relation R is a set of ordered pairs. For example, the ordering 
relation on the numbers 0-3 is captured by — and in fact is — the set of 
ordered pairs 


{(0,1), (0, 2), (0,3), (1,2), (1,3), (2, 3)}. 

The domain of R (written dom R) is the set of all objects x such that 
(x, y) e R for some y. The range of R (written ran R) is the set of all 
objects y such that (x, y) e R for some x. The union of dom R and 
ran R is th e field of R, fid R. 
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An n-ary relation on A is a subset of A". If n > 1, it is a relation. But a 
1-ary (unary) relation on A is simply a subset of A. A particularly simple 
binary relation on A is the equality relation {{x, x) \ x e A} on A. For 
an n -ary relation R on A and subset B of A. the restriction of R to B 
is the intersection RH B". For example, the relation displayed above is 
the restriction to the set B = {0, 1, 2, 3} of the ordering relation on N. 

A function is a relation F with the property of being single-valued: 
For each x in dom F there is only one y such that (x, y) e F. As usual, 
this unique y is said to be the value Fix) that F assumes at x. (This 
notation goes back to Euler. It is a pity he did not choose (x)F instead; 
that would have been helpful for the composition of functions: / o g is 
the function whose value at x is f(g(x)), obtained by applying first g 
and then /.) 

We say that F maps A into B and write 
F : A -» B 

to mean that F is a function, dom F — A, and ran F C B. If in addition 
ran F — B, then F maps A onto B. F is one-to-one iff for each y in 
ran F there is only one x such that ( x , y) e F. If the pair (x , y) is in 
dom F, then we let F(x, y) — F((x, y)). This notation is extended to 
n-tuples; F(x i,..., x„) = F((x i,..., x„)). 

An n-ary operation on A is a function mapping A” into A. For exam¬ 
ple, addition is a binary operation on N, whereas the successor operation 
S (where S(n) — n + 1) is a unary operation on N. If / is an n -ary op¬ 
eration on A, then the restriction of / to a subset B of A is the function 
g with domain B" which agrees with / at each point of B". Thus, 

8 = f n (B n x A). 

This g will be an n-ary operation on B iff B is closed under /, in the 
sense that f{b\,... ,b n ) e B whenever each /;, is in B. In this case, 
g = fnB”+\ in agreement with our definition of the restriction of a 
relation. For example, the addition operation on N, which contains such 
triples as ((3, 2), 5), is the restriction to N of the addition operation on 
K, which contains many more triples. 

A particularly simple unary operation on A is the identity function 
Id on A, given by the equation 

Id{x) — x for x e A. 

Thus Id — {{x,x) i e A). 

For a relation R, we define the following: 

R is reflexive on A iff (x, x) e R for every x in A. 

R is symmetric iff whenever (x, y) e R, then also (y, x) e R. 

R is transitive iff whenever both (x, y) e R and (y,z) e R (if this 
ever happens), then also (x, z) e R. 
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R satisfies trichotomy on A iff for every x and y in A. exactly one of 
the three possibilities, (x, y) e R. x = y, or (y, x) e R , holds. 

R is an equivalence relation on A iff R is a binary relation on A that 
is reflexive on A, symmetric, and transitive. 

R is an ordering relation on A iff R is transitive and satisfies tri¬ 
chotomy on A. 

For an equivalence relation If on A we define, for x e A, the equiv¬ 
alence class [x] of x to be {y \ (x, y) e R}. The equivalence classes 
then partition A. That is, the equivalence classes are subsets of A such 
that each member of A belongs to exactly one equivalence class. For x 
and y in A, 

M = [y] iff {x, y) e R. 

The set N of natural numbers is the set {0, 1,2,...}. (Natural num¬ 
bers can also be defined set-theoretically, a point that arises briefly in 
Section 3.7.) A set A is finite iff there is some one-to-one function / 
mapping (for some natural number n) the set A onto {0, l,... ,n — 1}. 
(We can think of / as “counting” the members of A.) 

A set A is countable iff there is some function mapping A one-to-one 
into N. For example, any finite set is obviously countable. Now consider 
an infinite countable set A. Then from the given function / mapping 
A one-to-one into N, we can extract a function /' mapping A one-to- 
one onto N. For some ao € A, /(ao) is the least member of ran/, let 
/'(ao) = 0. In general there is a unique a„ e A such that f(a n ) is the 
(n + l)st member of ran /; let f(a n ) = n. Note that A = {ao, a i, ...}. 
(We can also think of /' as “counting” the members of A, only now the 
counting process is infinite.) 

Theorem OB Let A be a countable set. Then the set of all finite 
sequences of members of A is also countable. 

Proof. The set S of all such finite sequences can be characterized 
by the equation 


S = (J A" +1 . 

ne N 

Since A is countable, we have a function / mapping A one- 
to-one into N. 

The basic idea is to map S one-to-one into N by assigning 
to (ao. a\,..., a m ) the number 2^ (flo)+1 3^ (ai)+1 • ... • 
where p m is the (m + l)st prime. This suffers from the defect 
that this assignment might not be well-defined. For conceivably 
there could be (ao. a\,..., a m ) = (bo. b\,..., b„), with a, and 
bj in A but with m n. But this is not serious; just assign to 
each member of S the smallest number obtainable in the above 
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fashion. This gives us a well-defined map; it is easy to see that it 
is one-to-one. H 

At times we will speak of trees, which can be useful in providing 
intuitive pictures of some situations. But our comments on trees will 
always be informal; the theorems and proofs will not rely on trees. 
Accordingly, our discussion here of trees will be informal. 

For each tree there is an underlying finite partial ordering. We can 
draw a picture of this partial ordering R\ if (a, b) e R, then we put a 
lower than b and connect the points by a line. Pictures of two typical 
tree orderings are shown. 



• • 


(In mathematics, trees grow downward, not upward.) There is always a 
highest point in the picture (the root). Furthermore, while branching is 
permitted below some vertex, the points above any given vertex must 
lie along a line. 

In addition to this underlying finite partial ordering, a tree also has a 
labeling function whose domain is the set of vertices. For example, one 
tree, in which the labels are natural numbers, is shown. 

4 





At a few points in the book we will use the axiom of choice. But 
usually these uses can be eliminated if the theorems in question are 
restricted to countable languages. Of the many equivalent statements of 
the axiom of choice, Zorn’s lemma is especially useful. 

Say that a collection C of sets is a chain iff for any elements x and 
y of C, either x C. y or y C. x. 

Zorn's Lemma Let A be a set such that for any chain CCA, the set 
(J C is in A. Then there is some element m e A which is maximal 
in the sense that it is not a subset of any other element of A. 
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Cardinal Numbers 

All infinite sets are big, but some are bigger than others. (For example, 
the set of real numbers is bigger than the set of integers.) Cardinal num¬ 
bers provide a convenient, although not indispensable, way of talking 
about the size of sets. 

It is natural to say that two sets A and B have the same size iff there is 
a function that maps A one-to-one onto B. If A and B are finite, then this 
concept is equivalent to the usual one: If you count the members of A 
and the members of B, then you get the same number both times. But it 
is applicable even to infinite sets A and B, where counting is difficult. 

Formally, then, say that A and B are equinumerous (written A ~ B) 
iff there is a one-to-one function mapping A onto B. For example, the 
set N of natural numbers and the set Z of integers are equinumerous. It 
is easy to see that equinumerosity is reflexive, symmetric, and transitive. 

For finite sets we can use natural numbers as measures of size. The 
same natural number would be assigned to two finite sets (as measures 
of their size) iff the sets were equinumerous. Cardinal numbers are 
introduced to enable us to generalize this situation to infinite sets. 

To each set A we can assign a certain object, the cardinal number 
(or cardinality ) of A (written card A), in such a way that two sets are 
assigned the same cardinality iff they are equinumerous: 

card A = card B iff A ~ B. (K) 

There are several ways of accomplishing this; the standard one these 
days takes card A to be the least ordinal equinumerous with A. (The 
success of this definition relies on the axiom of choice.) We will not 
discuss ordinals here, since for our purposes it matters very little what 
card A actually is, any more than it matters what the number 2 actually 
is. What matters most is that (K) holds. It is helpful, however, if for a 
finite set A, card A is the natural number telling how many elements A 
has. Something is a cardinal number, or simply a cardinal, iff it is card A 
for some set A. 

(Georg Cantor, who first introduced the concept of cardinal number, 
characterized in 1895 the cardinal number of a set M as “the general 
concept which, with the help of our active intelligence, comes from the 
set M upon abstraction from the nature of its various elements and from 
the order of their being given.”) 

Say that A is dominated by B (written A < B) iff A is equinumerous 
with a subset of B. In other words, A < B iff there is a one-to-one 
function mapping A into B. The companion concept for cardinals is 

cardA<cardZ? iff A<B. 

(It is easy to see that < is well defined; that is, whether or not k < X 
depends only on the cardinals k and X themselves, and not the choice of 
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sets having these cardinalities.) Dominance is reflexive and transitive. A 
set A is dominated by N iff A is countable. The following is a standard 
result in this subject. 

Schroder-Bernstein Theorem (a) For any sets A and B, if A < B 
and B < A, then A ~ B. 

(b) For any cardinal numbers k and X, if k < X and X < k, then 
k — X. 

Part (b) is a simple restatement of part (a) in terms of cardinal num¬ 
bers. The following theorem, which happens to be equivalent to the 
axiom of choice, is stated in the same dual manner. 

Theorem 0C (a) For any sets A and B, either A < B or B < A. 

(b) For any cardinal numbers k and X, either k < X or X < k . 

Thus of any two cardinals, one is smaller than the other. (In fact, any 
nonempty set of cardinal numbers contains a smallest member.) The 
smallest cardinals are those of finite sets: 0, 1,2,.... There is next the 
smallest infinite cardinal, cardN, which is given the name No. Thus we 
have 


0, 1,2. N 0 ,Ni,..., 

where N| is the smallest cardinal larger than No. The cardinality of the 
real numbers, cardR, is called “2^°.” Since R is uncountable, we have 
N 0 < 2*°. 

The operations of addition and multiplication, long familiar for finite 
cardinals, can be extended to all cardinals. To compute k + X we choose 
disjoint sets A and B of cardinality k and X, respectively. Then 

k + X — card(A U B). 

This is well defined; i.e., k + X depends only on k and X, and not on the 
choice of the disjoint sets A and B. For multiplication we use 

k ■ X = card(A x B). 

Clearly these definitions are correct for finite cardinals. The arithmetic 
of infinite cardinals is surprisingly simple (with the axiom of choice). 
The sum or product of two infinite cardinals is simply the larger of them: 

Cardinal Arithmetic Theorem For cardinal numbers k and X, if k < 
X and X is infinite, then k + X = X. Furthermore, if k ^ 0, then 
k ■ X — X. 


In particular, for infinite cardinals k. 
No ■ K — K. 
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Theorem 0D For an infinite set A, the set |J H A" +1 of all finite se¬ 
quences of elements of A has cardinality equal to card A. 

We already proved this for the case of a countable A (see Theo¬ 
rem OB). 

Proof. Each A" +1 has cardinality equal to card A, by the cardinal 
arithmetic theorem (applied n times). So we have the union of K (l 
sets of this size, yielding Kq • card A = card A points altogether. 

H 

Example. It follows that the set of algebraic numbers has cardinal¬ 
ity K(). First, we can identify each polynomial (in one variable) 
over the integers with the sequence of its coefficients. Then by 
the theorem there are K (l polynomials. Each polynomial has a fi¬ 
nite number of roots. To give an extravagant upper bound, note 
that even if each polynomial had K 0 roots, we would then have 
Ko • Ko = Ko algebraic numbers altogether. Since there are at least 
this many, we are done. 

Since there are uncountably many (in fact, 2 K °) real numbers, it 
follows that there are uncountably many (in fact, 2 K ° J transcendental 
numbers. 
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Sentential Logic 


SECTION 1.0 
Informal Remarks on 
Formal Languages 



We are about to construct (in the next section) a 
language into which we can translate English sen¬ 
tences. Unlike natural languages (such as English 
or Chinese), it will be a formal language, with pre¬ 
cise formation rules. But before the precision be¬ 
gins, we will consider here some of the features to 
be incorporated into the language. 

As a first example, the English sentence “Traces 
of potassium were observed” can be translated into 
the formal language as, say, the symbol K. Then for 
the closely related sentence “Traces of potassium 
were not observed,” we can use (—i K). Here -i is 
our negation symbol, read as “not.” One might also 
think of translating “Traces of potassium were not 
observed” by some new symbol, for example, J, 
but we will prefer instead to break such a sentence 
down into atomic parts as much as possible. For an 
unrelated sentence, “The sample contained chlo¬ 
rine,” we choose, say, the symbol C. Then the fol¬ 
lowing compound English sentences can be trans¬ 
lated as the formulas shown at the right: 

If traces of potassium were ob- (K —> (-1 C)) 

served, then the sample did not 
contain chlorine. 

The sample contained chlorine, (C A K) 

and traces of potassium were ob¬ 
served. 
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The second case uses our conjunction symbol A to translate “and.” 
The first one uses the more familiar arrow to translate “if.... then ... 

In the following example the disjunction symbol V is used to translate 
“or”: 

Either no traces of potassium were observed, or ((— i K) V (-1 C)) 

the sample did not contain chlorine. 

Neither did the sample contain (-1 (C V K)) 

chlorine, nor were traces of or 

potassium observed. ((—1 C) A (—1 K)) 

For this last sentence, we have two alternative translations. The re¬ 
lationship between them will be discussed later. 

One important aspect of the decompositions we will make of com¬ 
pound sentences is that whenever we are given the truth or falsity of the 
atomic parts, we can then immediately calculate the truth or falsity of 
the compound. Suppose, for example, that the chemist emerges from 
her laboratory and announces that she observed traces of potassium but 
that the sample contained no chlorine. We then know that the four above 
sentences are true, false, true, and false, respectively. In fact, we can con¬ 
struct in advance a table giving the four possible experimental results 
(Table I). We will return to the discussion of such tables in Section 1.2. 


TABLE I 


K 

C 

(—'(C V K)) 

((- C) A (-1 K)) 

F 

F 

T 

T 

F 

T 

F 

F 

T 

F 

F 

F 

T 

T 

F 

F 


Use of formal languages will allow us to escape from the imprecision 
and ambiguities of natural languages. But this is not done without cost; 
our formal languages will have a sharply limited degree of expressive¬ 
ness. 

In order to describe a formal language we will generally give three 
pieces of information: 

1. We will specify the set of symbols (the alphabet). In the present 
case of sentential logic some of the symbols are 

A!, A 2 ,.... 

2. We will specify the rules for forming the “grammatically correct” 
finite sequences of symbols. (Such sequences will be called well-formed 
formulas or wjfs.) For example, in the present case 

(Aj —y (—. A 2 » 
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will be a wff, whereas 

)) —A 3 

will not. 

3. We will also indicate the allowable translations between English 
and the formal language. The symbols Ai, A 2 ,... can be translations 
of declarative English sentences. 

Only in this third part is any meaning assigned to the wffs. This 
process of assigning meaning guides and motivates all we do. But it 
will also be observed that it would theoretically be possible to carry out 
various manipulations with wffs in complete ignorance of any possible 
meaning. A person aware of only the first two pieces of information 
listed above could perform some of the things we will do, but it would 
make no sense to him. 

Before proceeding, let us look briefly at another class of formal 
languages of widespread interest today. These are the languages used 
by (or at least in connection with) digital computers. 

There are many of these languages. In one of them a typical wff is 

011010110101000111110001000001111010 . 

In another a typical wff is 

STEP#ADDIMAX, A. 

(Here # is a symbol called a blank; it is brought into the alphabet so that 
a wff will be a string of symbols.) A well-known language called C++ 
has wffs such as 


while(*s++); 

In all cases there is a given way of translating the wffs into English, 
and (for a restricted class of English sentences) a way of translating 
from English into the formal language. But the computer is unaware of 
the English language. An unthinking automaton, the computer juggles 
symbols and follows its program slavishly. We could approach formal 
languages that way, too, but it would not be much fun. 


SECTION 1.1 _ 

The Language of Sentential Logic 

We assume we are given an infinite sequence of distinct objects which 
we will call symbols, and to which we now give names (Table II). We 
further assume that no one of these symbols is a finite sequence of other 
symbols. 



14 


A Mathematical Introduction to Logic 


TABLE II 


Symbol 

Verbose name 

Remarks 

( 

left parenthesis 

punctuation 

) 

right parenthesis 

punctuation 

— 1 

negation symbol 

English: not 

A 

conjunction symbol 

English: and 

V 

disjunction symbol 

English: or (inclusive) 

—► 

conditional symbol 

English: if_, then_ 


biconditional symbol 

English: if and only if 

Ai 

first sentence symbol 


a 2 

second sentence symbol 


A„ 

77 th sentence symbol 



Several remarks are now in order: 

1. The five symbols 


-i, A, V, —<-► 

are called sentential connective symbols. Their use is suggested by the 
English translation given above. The sentential connective symbols, 
together with the parentheses, are the logical symbols. In translating to 
and from English, their role never changes. The sentence symbols are 
the parameters (or nonlogical symbols). Their translation is not fixed; 
instead they will be open to a variety of interpretations, as we shall 
illustrate shortly. 

2. We have included infinitely many sentence symbols. On the one 
hand, a more modest alternative would be to have one sentence symbol 
A, and a prime '. Then we could use the potentially infinite sequence 

A, A', A",... 

in place of 

Ai, A 2 , A 3 .... 

This alternative has the advantage that it brings the total number of 
different symbols down to nine. On the other hand, a less modest al¬ 
ternative would be to allow an arbitrary set of set sentence symbols, 
countable or not. Much of what is said in this chapter would con¬ 
tinue to be applicable in this case; the exceptions are primarily in Sec¬ 
tion 1.7. 

3. Some logicians prefer to call A„ the nth proposition symbol (and 
to speak of propositional logic instead of sentential logic). This stems 
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from wanting the word “sentence” to refer to a particular utterance, and 
wanting a proposition to be that which a sentence asserts. 

4. We call these objects “symbols,” but we remain neutral as to what 
the exact ontological status of symbols might be. In the leftmost column 
of our list of symbols, names of the symbols are printed. For example, 
A 243 is one symbol, namely the 243rd sentence symbol. (On the other 
hand, “A 243 ” is a name of that symbol. The conditional symbol may 
or may not have the geometric property of being shaped like an arrow, 
although its name “—»” does.) The symbols themselves can be sets, 
numbers, marbles, or objects from a universe of linguistic objects. In 
the last case, it is conceivable that they are actually the same things as 
the names we use for them. Another possibility, which will be explained 
in the next chapter, is that the sentence symbols are themselves formulas 
in another language. 

5. We have assumed that no symbol is a finite sequence of other 
symbols. We mean by this that not only are the symbols listed distinct 
(e.g., A 3 ^ —), but no one of them is a finite sequence of two or more 
symbols. For example, we demand that A 3 ^ (- 1 , A 4 , (}. The purpose of 
this assumption is to assure that finite sequences of symbols are uniquely 
decomposable. If 

{a \,..., a m ) = (hi,..., b n ) 

and each a, and each bj is a symbol, then m = n and < 3 / = /;,. (See 
Chapter 0, Lemma 0A, and subsequent remarks.) 

An expression is a finite sequence of symbols. We can specify an 
expression by concatenating the names of the symbols; thus (—1 Ai) is 
the sequence ((, - 1 , Aj,)}. This notation is extended: If a and ft are 
sequences of symbols, then aj5 is the sequence consisting first of the 
symbols in the sequence a followed by the symbols in the sequence fi. 

For example, if a and ft are the expressions given by the equations 

a = (—■ Ai), 

ft — a 2 , 

then (a —► ft) is the expression 

((-.AO —A 2 ). 

We should now look at a few examples of possible translations of 
English sentences into expressions of the formal language. Let A, B, 
..., Z be the first twenty-six sentence symbols. (For example, E = A 5 .) 

1. English: The suspect must be released from custody. Transla¬ 
tion: R. 

English: The evidence obtained is admissible. Translation: E. 

English: The evidence obtained is inadmissible. Translation: (— 1 E). 
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English: The evidence obtained is admissible, and the suspect need 
not be released from custody. Translation: (E A (—i R)). 

English: Either the evidence obtained is admissible, or the suspect 
must be released from custody (or possibly both). Translation: (E V R). 

English: Either the evidence obtained is admissible, or the suspect 
must be released from custody, but not both. Translation: ((E V R) A 
(- (E A R))). We intend always to use the symbol V to translate the 
word “or” in its inclusive meaning of “and/or.” 

English: The evidence obtained is inadmissible, but the suspect need 
not be released from custody. Translation: ((— i E) A (-> R)). On the other 
hand, the expression ((— i E) V (—1R)) translates the English: Either the 
evidence obtained is inadmissible or the suspect need not be released 
from custody. 

2. English: If wishes are horses, then beggars will ride. Translation: 
(W -» B). 

English: Beggars will ride if and only if wishes are horses. Transla¬ 
tion: (B W). 

3. English: This commodity constitutes wealth if and only if it is 
transferable, limited in supply, and either productive of pleasure or pre¬ 
ventive of pain. Translation: (W •*-» (T A (L A (P V Q)))). Here W is 
the translation of “This commodity constitutes wealth.” Of course in the 
preceding example we used W to translate a different sentence. We are 
not tied to any one translation. 

One note of caution: Do not confuse a sentence in the English lan¬ 
guage (Roses are red) with a translation of that sentence in the formal 
language (such as R). These are different. The English sentence is pre¬ 
sumably either true or false. But the formal expression is simply a se¬ 
quence of symbols. It may indeed be interpreted in one context as a true 
(or false) English sentence, but it can have other interpretations in other 
contexts. 

Now some expressions cannot be obtained as translations of any 
English sentences but are mere nonsense, such as 

((—A 3 . 

We want to define the well-formed formulas (wffs) to be the “gram¬ 
matically correct” expressions; the nonsensical expressions must be 
excluded. The definition will have the following consequences: 

(a) Every sentence symbol is a wff. 

(b) If a and /8 are wffs, then so are (-1 a), (a A /8), (a V /3), (a —»• /J), 
and (a <-► /3). 

(c) No expression is a wff unless it is compelled to be one by (a) 
and (b). 
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We want to make this third property (about compulsion) more pre¬ 
cise. A well-formed formula (or simply formula or wff) is an expres¬ 
sion that can be built up from the sentence symbols by applying some 
finite number of times the formula-building operations (on expressions) 
defined by the equations 

£-,(&) = (-'«) 

£a (a, P) = (aA P), 

£ v (a, P) — (a V P), 

£->(&, P) — (a —* P), 
f M (a, P) = (a •*-»■ P). 

For example, 

((A! A A 10 ) — ((-, A 3 ) V (A„ <- A 3 ») 
is a wff, as can be seen by contemplating its ancestral tree: 


((Ax A A 10 ) - ((-.A3) V (Ag ~ A*))) 



The tree illustrates the construction of the expression from four sen¬ 
tence symbols by five applications of formula-building operations. This 
example is atypical in that it uses all five formula-building operations. 
For a smaller example, A 3 is a wff; its ancestral tree has only a single 
vertex; each formula-building operation is used zero times. But this is 
as small as examples get; we do not count the empty sequence as being 
“built up from the sentence symbols.” 

This sort of construction, of taking some basic building blocks (here 
the sentence symbols) and “closing” under some operations (here five 
operations), occurs frequently both in logic and in other branches of 
mathematics. In Section 1.4, we will examine this sort of construction 
in a more general setting. 

We can elaborate on the “building” idea as follows. Define a con¬ 
struction sequence to be a finite sequence (gj,..., e„) of expressions 
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such that for each i < n we have at least one of 
Si is a sentence symbol 
Si — S^iSj) for some j < i 
Sj = £□ (Sj, Sk) for some j < i, k < i 

where □ is one of the binary connectives, A, V, —>, •*-»■. Then the well- 
formed formulas can be characterized as the expressions a such that 
some construction sequence ends with a. We can think of e,- as the 
expression at stage i in the building process. 

For our earlier example, 

((A : A A 10 ) -> ((-. A 3 ) V (A s <- A 3 ))) 

we obtain a construction sequence by squashing its ancestral tree into a 
linear ordering. 

One feature of this sort of construction is that it yields an induction 
principle. Say that a set S is closed under a two-place function / iff 
whenever x e Sandy e S then fix, y) e S, and similarly for one-place 
functions and so forth. 

Induction Principle If S is a set of wffs containing all the sentence 
symbols and closed under all five formula-building operations, 
then S is the set of all wffs. 

First Proof. Consider anarbitrarywffa.lt is built up from sentence 
symbols by applying some finite number of times the formula¬ 
building operations. Working our way up the corresponding an¬ 
cestral tree, we find that each expression in the tree belongs to S. 
Eventually (that is, after a finite number of steps) at the top of the 
tree we find that a e S. H 

Second Proof. We will repeat the argument, but without the trees. 
Consider an arbitrary wff a. It is the last member of some con¬ 
struction sequence (si,..., s„). By ordinary strong numerical in¬ 
duction on the number i, we see that each £; e S, i < n. 

That is, we suppose, as our inductive hypothesis, that Sj e S 
for all j < i. We then verify that e, e S, by considering the 
various cases. So by strong induction on i, it follows that £,■ e S 
for each i < n. In particular, the last member a belongs to S. 

This principle will receive much use in the coming pages. In the 
following example we use it to show that certain expressions are not 
wffs. 

Example. Any expression with more left parentheses than right 
parentheses is not a wff. 
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Proof. The idea is that we start with sentence symbols (having 
zero left parentheses and zero right parentheses), and then ap¬ 
ply formula-building operations which add parentheses only in 
matched pairs. We can rephrase this argument as follows: The 
set of “balanced” wffs (having equal numbers of left and right 
parentheses) contains all sentence symbols and is closed under 
the formula-building operations. The induction principle then as¬ 
sures us that all wffs are balanced. H 

A special feature of our particular formula-building operations is that 
they build up and never down. That is, the expression £□(«, fi) always 
includes as a segment the entire sequence a (and the entire sequence f J >) 
plus other symbols. In particular, it is longer than either a or /3. 

This special feature will simplify the problem of determining, given 
a wff (p, exactly how it was built up. All the building blocks, so to speak, 
are included as segments in the sequence q>. For example, if < p does not 
contain the symbol A 4 , then tp can be built up without ever using A 4 . 
(See Exercise 4.) 


Exercises 

1. Give three sentences in English together with translations into our 
formal language. The sentences should be chosen so as to have an 
interesting structure, and the translations should each contain 15 or 
more symbols. 

2. Show that there are no wffs of length 2, 3, or 6, but that any other 
positive length is possible. 

3. Let a be a wff; let c be the number of places at which binary con¬ 
nective symbols (A, V, —>, <—>) occur in a; let s be the number of 
places at which sentence symbols occur in a. (For example, if a is 
(A —(—1 A)) then c = 1 and .v = 2.) Show by using the induction 
principle that s = c + 1. 

4. Assume we have a construction sequence ending in <p, where (p does 
not contain the symbol A 4 . Suppose we delete all the expressions 
in the construction sequence that contain A 4 . Show that the result is 
still a legal construction sequence. 

5. Suppose that a is a wff not containing the negation symbol -i. 

(a) Show that the length of a (i.e., the number of symbols in the 
string) is odd. 

(b) Show that more than a quarter of the symbols are sentence 
symbols. 

Suggestion: Apply induction to show that the length is of the form 
4-k + 1 and the number of sentence symbols is k + 1. 
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SECTION 1.2 _ 

Truth Assignments 

We want to define what it means for one wff of our language to fol¬ 
low logically from other wffs. For example, A[ should follow from 
(Ai A A 2 ). For no matter how the parameters Ai and A 2 are translated 
back into English, if the translation of (Ai A A 2 ) is true, then the transla¬ 
tion of Ai must be true. But the notion of all possible translations back 
into English is unworkably vague. Luckily the spirit of this notion can 
be expressed in a simple and precise way. 

Fix once and for all a set {F, T] of truth values consisting of two 
distinct points: 

F, called falsity, 

T, called truth. 

(It makes no difference what these points themselves are; they might as 
well be the numbers 0 and 1.) Then a truth assignment v for a set S of 
sentence symbols is a function 

v : S-> {F, Tj 

assigning either T or F to each symbol in S. These truth assignments 
will be used in place of the translations into English mentioned in the 
preceding paragraph. 

(At this point we have committed ourselves to two-valued, logic. It is 
also possible to study three-valued logic, in which case one has a set of 
three possible truth values. And then, of course, it is a small additional 
step to allow 512 or Kq truth values; or to take as the set of truth values 
the unit interval [0, 1 ] or some other convenient space, A particularly 
interesting case is that for which the truth values form something called 
a complete Boolean algebra. But it is two-valued logic that has always 
had the greatest significance, and we will confine ourselves to this case.) 

Let S be the set of wffs that can be built up from S by the five 
formula-building operations. (S can also be characterized as the set of 
wffs whose sentence symbols are all in S\ see the remarks at the end of 
the preceding section.) We want an extension v of v, 

v:S-+ {F, T}, 

that assigns the correct truth value to each wff in S. It should meet the 
following conditions: 

0. For any A e S, v(A ) = v(A). (Thus iJ is an extension of i>.) 

For any a, in S: 


1. v ((—1 or)) = 


T if v(a ) = F, 
F otherwise. 
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2. v((a A /))) = | p 

3. v((a V/»)={£ 

4. T7((a — J8)) = | ^ 

5. F((« <- £)) = | T p 

Conditions 1-5 are given in tabular form in Table III. It is at this point 
that the intended meaning of, for example, the conjunction symbol enters 
into our formal proceedings. Note especially the intended meaning of 
—k Whenever a is assigned F, then (a —»• fi) is considered “vacuously 
true” and is assigned the value T. For this and the other connectives, it 
is certainly possible to question how accurately we have reflected the 
common meaning in everyday speech of “if ..., then,” “or,” etc. But 
our ultimate concern lies more with mathematical statements than with 
the subtle nuances of everyday speech. 


TABLE 111 


a. 

0 

(-><*) 

(a A (3) 

(a V (3) 

(a —> f3) 

(a <-*• (3) 

T 

T 

F 

T 

T 

T 

T 

T 

F 

F 

F 

T 

F 

F 

F 

T 

T 

F 

T 

T 

F 

F 

F 

T 

F 

F 

T 

T 


if via) — T and v(j3) — T, 
otherwise. 

if v(a) — T or vifi) — T (or both), 
otherwise. 

if v(a) = T and vifi) — F, 
otherwise. 

if u(a) = 
otherwise. 


For example, we might translate the English sentence, “If you’re 
telling the truth then I’m a monkey’s uncle,” by the formula (Y -> M). 
We assign this formula the value T whenever you are fibbing. In as¬ 
signing the value T, we are certainly not asserting any causal connec¬ 
tion between your veracity and any simian features of my nephews or 
nieces. The sentence in question is a conditional statement. It makes an 
assertion about my relatives provided a certain condition — that you are 
telling the truth — is met. Whenever that condition fails, the statement 
is vacuously true. 

Very roughly, we can think of a conditional formula (a —► fi) as 
expressing a promise that if a certain condition is met (viz., that a is 
true), then fi is true. If the condition a turns out not to be met, then the 
promise stands unbroken, regardless of fi. 

As an example of the calculation of v, let a be the wff 

((A 2 — (A, — Afi)) <- ((A 2 A A l ) — A 6 )) 
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and let v be the truth assignment for {Aj, A 2 , A () | such that 

w(AO = T, 
v(A 2 ) = T, 
v(A 6 ) = F. 

We want to compute v(a). We can look at the tree which displays the 
construction of a: 


((A, —► (Aj A e )) 



A 2 (A x —► A 6 ) 

T F 

/ \ 


Ai A e 

T F 


<-* ((A 2 a Aj) -► A 6 )) 

T 



(A« a Aj) A 6 

T F 

/ \ 


A 2 A x 
T T 


Working from the bottom up, we can assign to each vertex f J > of the tree 
the value u(/3). So as a first step we compute 

t>((Ai —> A 6 )) = F and rJ((A 2 A AO) = T. 

Next we compute vdAo —> (Ai —*• Ag))) = F, and so forth. Finally, at 
the top of the tree we arrive at v(a) — T. 

Actually this computation can be carried out with much less writing. 
First, the tree can be given in starker form: 



And even this can be compressed into a single line (with the parentheses 
restored): 
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((A 2 -► (Aj -► A 6 )) ((A* a AJ A.)). 

T F T F F T T T T F F 

Theorem 12A For any truth assignment v for a set S there is a 
unique function V : S -> { F, T) meeting the aforementioned 
conditions 0-5. 

The full proof of this theorem will emerge in the next two sections (1.3 
and 1.4). But it should already seem extremely plausible, especially in 
light of the preceding example. In proving the existence of v, the crucial 
issue will, in effect, be the uniqueness of the trees mentioned in the 
example. 

We say that a truth assignment v satisfies tp iff v(cp) = T. (Of course 
for this to happen, the domain of v must contain all sentence symbols in 
<p.) Now consider a set E of wffs (thought of as hypotheses) and another 
wff r (thought of as a possible conclusion). 

Definition. E tautologically implies r (written E |= r ) iff every 
truth assignment for the sentence symbols in E and r that satisfies 
every member of E also satisfies r. 

This definition reflects our intuitive feeling that a conclusion follows 
from a set of hypotheses if the assumption that the hypotheses are true 
guarantees that the conclusion is true. 

Several special cases of the concept of tautological implication de¬ 
serve mention. First, take the special case in which E is the empty set 0. 
Observe that it is vacuously true that any truth assignment satisfies every 
member of 0 . (Flow could this fail? Only if there was some unsatisfied 
member of 0 , which is absurd.) Hence we are left with: 0 |= r iff every 
truth assignment (for the sentence symbols in r) satisfies r. In this case 
we say that r is a tautology (written |= r). For example, the wff ((A 2 —► 
(Ai —> Ag)) ((A 2 A Ai) —> Ag)) was in a recent example found to be 
satisfied by one of the eight possible truth assignments for {Ai, A 2 , Ag}. 
In fact, it is satisfied by the other seven as well, and hence is a tautology. 

Another special case is that in which no truth assignment satisfies 
every member of E. Then for any r, it is vacuously true that E |= r. 
For example, 

{A, (-1 A)} |= B. 

There is no deep principle involved here; it is simply a by-product of 
our definitions. 

Example. {A, (A —► B)} |= B. There are four truth assignments for 
{A, B}. It is easy to check that only one of these four satisfies both 
A and (A — ► B), namely the v for which v(A) — c(B) = T. This 
v also satisfies B. 
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If E is singleton {cr}, then we write “cr |= r” in place of “{cr} |= r.”If 
both cr \= x and r |= cr, then a and r are said to be tautologically equiv¬ 
alent (written a |==| r). For example, in Section 1.0 we encountered 
the wffs (-1 (C V K)) and ((—i C) A (—i K)) as alternative translations of an 
English sentence. We can now assert that they are tautologically equiv¬ 
alent. 

We can already state here a nontrivial fact whose proof will be given 
later (in Section 1.7). 

Compactness Theorem Let E be an infinite set of wffs such that for 
any finite subset Eo of E, there is a truth assignment that satisfies 
every member of Eo. Then there is a truth assignment that satisfies 
every member of E. 

This theorem can be restated more simply as: If every finite subset of 
E is satishable, then E itself is satishable. (Readers familiar with some 
general topology might try to discover why this is called “compactness 
theorem”; it does assert the compactness of a certain topological space. 
They can then prove the theorem for themselves, by using Tychonoff’s 
theorem on product spaces.) 

Truth Tables 

There is a systematic procedure, which we will now illustrate, for check¬ 
ing, given wffs o \,..., Ok, and r, whether or not 

{ai, ..., o k ] |= r. 

In particular (when k = 0), the procedure will, given a wff, decide 
whether or not it is a tautology. 

As a first example, we can show that 

(-i (A A B)) |= ((-i A) V (-i B)). 

To do this, we consider all truth assignments for {A, B}. There are four 
such assignments; in general there are 2" truth assignments for a set of 
n sentence symbols. The four can be listed in a table: 

A B 

T T 

T F 

F T 

F F 

This table can then be expanded to include (-i (A A B)) and ((-i A) V 
(—iB)).For each formula we compute the 7”s and F’s the way described 
before, writing the truth value under the correct connective (Table IV). 
(The two leftmost columns of Table IV are actually unnecessary.) We 
can now see from this table that all those truth assignments satisfying 
(-1 (A A B)) — and there are three such — also satisfy ((-i A) V (—i B)). 
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TABLE IV 


A 

B 

(-(A A B)) 

((—'A) V (—iB)) 

T 

T 

FT T T 

FT F FT 

T 

F 

TT F F 

FTTT F 

F 

T 

T F FT 

T F T F T 

F 

F 

T F F F 

T FTT F 


In fact, the converse holds also, and thus 

(-1 (A A B)) |==| ((“ 1 A) V (—i B)). 

To show that (-i (A A B)) ^ ((-> A) A (-> B)) we can construct the 
table as before. But only one line of the table is needed to establish that 
there is indeed a truth assignment satisfying (-i (A A B)) that fails to 
satisfy ((-■ A) A (-i B)). 

The more generally applicable a procedure it is, the less efficient it 
is likely to be. For example, to show that 

|= ((A V (B A C)) <-► ((A V B) A (A V C))), 

we could apply the truth-table method. But this requires eight lines 
(for the eight possible truth assignments for {A, B, C}). With a little 
cleverness, the tedium in this particular case can be reduced: 


((A v (B a C)) 
T T 

F F F F 
F T T T T 


<->((A v B) a (A v C))). 

T F F F F F 
T F T T T F T T 


In the first line we assumed only that i>(A) = T. Since that is enough 
information to obtain T for the wff, we assume in all later lines that 
v(A) = F. In the second line we assume that v (B) = F; this again lets 
us obtain T for the wff. So we may limit ourselves to the case u(B) = T. 
Since the expression is symmetric in B and C, we may further suppose 
u(C) = T. This leaves only the third line, whereupon we are done. 

As an example of the avoidance of a 16-line table, consider the 
following tautology: 

((((P a Q) -* R) -* S) - ((P -v R) - S)). 

T T T 

F T F F T 
TTTRRRFT T R R R F 


Here in the first line we dispose of the case where n(S) = T. In the 
second line we dispose of the case where either v(P) — F or v (Q) = F. 
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The third line incorporates the two remaining possibilities; here R is the 
truth value assigned to R and R is the opposite value. 

For the above example it is possible to see directly why it is a tautol¬ 
ogy. The stronger the antecedent (the expression on the left side), the 
weaker the conditional. Thus 

(PAQ)N P, 

(P —R) h ((PAQ) —R), 

(((P A Q) —► R) — S) |= ((P — R) — S). 

The problem of developing effective procedures that reduce the 
tedium is important for theorem proving by machine. Some of the pro¬ 
grams here may require testing wffs of sentential logic having thousands 
of sentence symbols. Truth tables are far too cumbersome for anything 
this large. The problem of developing highly efficient methods is a cur¬ 
rent area of research in computer science. 

Digression. Applying the truth-table method — carried out in 
full — to a wff with n sentence symbols requires making a table of 
2" lines. The trouble is, as n increases, 2" grows “exponentially.” For 
example, suppose you can generate the table at the rate of a million lines 
per second. (We are supposing you are aided by a computer, of course.) 
Then if n = 80, say, you will need “merely” 2 80 microseconds to form 
the complete table. Flow long is that? Converted to years, 2 80 microsec¬ 
onds is about 38 billion years. For comparison, the age of the universe 
is around 15 billion years. The conclusion is that 2 80 microseconds is 
more time than there has been time! 

Is there a faster method? Might there be some general method that, 
given any wff a with n sentence symbols, will determine whether or 
not a is a tautology in only 10 n 5 microseconds (or some other function 
of n that grows like a polynomial instead of growing exponentially)? 
(For n = 80, 10n 5 microseconds comes to only 9 hours.) The answer 
to this question is unknown, but it is widely believed that the answer is 
negative. The question is called the “P versus NP” problem, the most 
prominent open problem in theoretical computer science today. 


A Selected List of Tautologies 

1. Associative and commutative laws for A, V, •*-». 

2. Distributive laws: 

((A A (B V C)) ((A A B) V (A A C»). 
((A V (B A C)) <-► ((A V B) A (A V C))). 
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3. Negation: 

A)) <-A). 

((-- (A —B))«-(AA(-.B))). 

((-- (A <- B)) ((A A (-. B)) V ((-. A) A B))). 

De Morgan’s laws: 

((n(AAB))«((nA)V(nB))). 

((“ 1 (A V B)) ((—i A) A (—1B))). 


4. Other 


Excluded middle: 
Contradiction: 
Contraposition: 
Exportation: 


(A V (-i A)). 

(-i (A A (-i A))). 

((A —B) <-((-. B) — (-. A))). 

(((A A B) -v C) (A — (B -v C))). 


Exercises 

1. Show that neither of the following two formulas tautologically im¬ 
plies the other: 

(A <-► (B <-+ C)), 

((A A (B A C)) V ((-. A) A ((-. B) A (-i C)))). 

Suggestion: Only two truth assignments are needed, not eight. 

2. (a) Is (((P —» Q) —*• P) —» P) a tautology? 

(b) Define aj recursively as follows: a () = (P —<• Q) and ety+i = 
(Ok —* P). For which values of k is er^ a tautology? (Part (a) 
corresponds to k — 2.) 

3. (a) Determine whether or not ((P—> Q) V (Q—>• P)) is a tautology, 
(b) Determine whether or not ((P A Q) —► R) tautologically implies 

((P R) V (Q R)). 

4. Show that the following hold: 

(a) E; a |= iff E ^ (a ->• 0). 

(b) a\==\ P iff N (« ^ PI 

(Recall that E; a = EU{a}, the set E togetherwith the one possibly 
new member a.) 

5. Prove or refute each of the following assertions: 

(a) If either E |= a or E |= P, then E |= (a V P). 

(b) If E |= (a V P), then either E |= a or E | = p. 

6 . (a) Show that if V\ and V 2 are truth assignments which agree on all 

the sentence symbols in the wff a, then Pita) = V 2 (a). Use 
the induction principle. 
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(b) Let S be a set of sentence symbols that includes those in E and 
r (and possibly more). Show that E |= r iff every truth assign¬ 
ment for S which satisfies every member of E also satisfies r. 
(This is an easy consequence of part (a). The point of part (b) 
is that we do not need to worry about getting the domain of a 
truth assignment exactly perfect, as long as it is big enough. For 
example, one option would be always to use truth assignments 
on the set of all sentence symbols. The drawback is that these 
are infinite objects, and there are a great many — uncountably 
many — of them.) 

7. You are in a land inhabited by people who either always tell the 
truth or always tell falsehoods. You come to a fork in the road 
and you need to know which fork leads to the capital. There is a 
local resident there, but he has time only to reply to one yes-or-no 
question. What one question should you ask so as to learn which 
fork to take? Suggestion: Make a table. 

8 . (Substitution) Consider a sequence oq, aq, . . . of wffs. For each wff 
q> let (p* be the result of replacing the sentence symbol A„ by a n , 
for each n. 

(a) Let v be a truth assignment for the set of all sentence symbols; 
define u to be the truth assignment for which u{ A„) = v(a„). 
Show that u((p) = v(yp*). Use the induction principle. 

(b) Show that if q> is a tautology, then so is cp*. (For example, one 

of our selected tautologies is ((A A B) (B A A)). From this 
we can conclude, by substitution, that ((a A /3) (/S A a)) is 

a tautology, for any wffs a and /3.) 

9. (Duality) Let a be a wff whose only connective symbols are A, V, 
and - 1 . Let a* be the result of interchanging A and V and replacing 
each sentence symbol by its negation. Show that a* is tautologically 
equivalent to (-1 a). Use the induction principle. 

Remark: It follows that if a |==] /3 then a* |==] /3*. 

10. Say that a set E 1 of wffs is equivalent to a set E 2 of wffs iff for any 
wff a, we have Ei |= a iff E 2 |= a. A set E is independent iff no 
member of E is tautologically implied by the remaining members 
in E. Show that the following hold. 

(a) A finite set of wffs has an independent equivalent subset. 

(b) An infinite set need not have an independent equivalent subset. 
*(c) Let E = {<T ( ). <T],...}; show that there is an independent equiv¬ 
alent set E'. (By part (b), we cannot hope to have E' C E in 
general.) 

11. Show that a truth assignment v satisfies the wff 

(••• (Ai <-► A 2 ) A„) 
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iff v(A ,-) = F for an even number of i’s, 1 < i < n. (By the 
associative law for •*->, the placement of the parentheses is not 
crucial.) 

12. There are three suspects for a murder: Adams, Brown, and Clark. 
Adams says “I didn’t do it. The victim was an old acquaintance of 
Brown’s. But Clark hated him.” Brown states “I didn’t do it. I didn’t 
even know the guy. Besides I was out of town all that week.” Clark 
says “I didn’t do it. I saw both Adams and Brown downtown with 
the victim that day; one of them must have done it.” Assume that 
the two innocent men are telling the truth, but that the guilty man 
might not be. Who did it? 

13. An advertisement for a tennis magazine states, “If I’m not playing 
tennis. I’m watching tennis. And if I’m not watching tennis, I’m 
reading about tennis.” We can assume that the speaker cannot do 
more than one of these activities at a time. What is the speaker 
doing? (Translate the given sentences into our formal language; 
consider the possible truth assignments.) 

14. Let S be the set of all sentence symbols, and assume that i> : S -> 
{ F, 7'| is a truth assignment. Show there is at most one extension 
v meeting conditions 0-5 listed at the beginning of this section. 
(Suppose that 17 1 and TL are both such extensions. Use the induction 
principle to show that Ui — V 2 -) 

15. Of the following three formulas, which tautologically imply which? 

(a) (A <- B) 

(b) (-.((A —B) —(-.(B —A)))) 

(c) (((“i A) V B) A (A V (— 1 B))) 

SECTION 1.3 _ 

A Parsing Algorithm 

The purpose of this section is to prove that we have used enough paren¬ 
theses to eliminate any ambiguity in analyzing wffs. (The existence 
of the extension 17 of a truth assignment v will hinge on this lack of 
ambiguity. 1 ) 

It is instructive to consider the result of not having parentheses at all. 
The resulting ambiguity is illustrated by the wff 

Ai V A 2 A A 3 , 


1 The reader who has already accepted the existence of v may omit almost all of this 
section. The final subsection, on omitting parentheses, will still be needed. 
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which can be formed in two ways, corresponding to ((Aj V A 2 ) A A 3 ) 
and to (A! V (A 2 A A 3 )). If u(Ai) = T and 1 TA 3 ) = F, then there is an 
unresolvable conflict that arises in trying to compute U(Ai V A 2 A A 3 ). 

We must show that with our parentheses this type of ambiguity does 
not arise but that on the contrary each wff is formed in a unique way. 
There is one sense in which this fact is unimportant: If it failed, we 
would simply change notation until it was true. For example, instead 
of building formulas by means of concatenation, we could have used 
ordered pairs and triples: (- 1 , a) and (a, A, /l), and so forth. (This is, in 
fact, a tidy, but untraditional, method.) It would then be immediate that 
formulas had unique decompositions. But we do not have to resort to 
this device, and we will now prove that we do not. 

Lemma 13 A Every wff has the same number of left as right paren¬ 
theses. 

Proof. This was done as an example at the end of Section 1.1. H 

Lemma 13B Any proper initial segment of a wff contains an excess 
of left parentheses. Thus no proper initial segment of a wff can 
itself be a wff. 

Proof. We apply the induction principle to the set S of wffs pos¬ 
sessing the desired property (that proper initial segments are left- 
heavy). A wff consisting of a sentence symbol alone has no proper 
initial segments and hence is in S vacuously. To verify that S is 
closed under £ A , consider a and ft in S. The proper initial seg¬ 
ments of (a A ft) are the following: 

1 . (■ 

2 . («o. where a 0 is a proper initial segment of a. 

3. (a. 

4. (aA. 

5. {a A fto, where fto is a proper initial segment of ft. 

6 . (ce Aft. 

By applying the inductive hypothesis that a and ft are in S 
(in cases 2 and 5), we obtain the desired conclusion. For closure 
under the other four formula-building operations, the argument is 
similar. H 


A Parsing Algorithm 

We are now going to describe a procedure that, given an expression, 
both will determine whether or not the expression is a legal wff, and if 
it is, will construct the tree showing how it was built up from sentence 
symbols by application of the formula-building operations. Moreover, 
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we will see that this tree is uniquely determined by the wff. It is this 
last fact that will assure us that we have enough parentheses for an 
unambiguous notation. 

Assume then that we are given an expression. We will construct a tree, 
with the given expression at the top. Initially it is the only vertex in the 
tree, but as the procedure progresses, the tree will grow downward from 
the given expression. For an example, imagine the tree in Section 1.1. 

The algorithm consists of the following four steps: 

1. If all minimal vertices (the ones at the bottom) have sentence 
symbols, then the procedure is completed. (The given expression is 
indeed a wff, and we have constructed its tree.) Otherwise, select a 
minimal vertex having an expression that is not a sentence symbol. We 
examine that expression. 

2. The first symbol must* be (. If the second symbol is the negation 
symbol, jump to step 4. Otherwise, go on to step 3. 

3. Scan the expression from the left until first reaching (a, where 
a is a nonempty expression having a balance between left and right 
parentheses.** Then a is the first of the two constituents. The next 
symbol must* be A, V, —►, or •*->. This is the principal connective. The 
remainder of the expression, /?), must* consist of an expression p and a 
right parenthesis. We extend the tree by creating two new vertices below 
the present one, with a as the expression at the “left child” vertex, and 
p as the expression at the “right child” vertex. Return to step 1. 

4. The first two symbols are now known to be (->. The remainder 
of the expression, P), must* consist of an expression P and a right 
parenthesis. We extend the tree by creating one new vertex below the 
present one, with p as the expression at the child vertex. Return to step 1. 

Now for some comments in support of the correctness of this algo¬ 
rithm. 

First, given any expression, the procedure halts after a finite number 
of steps. This is because any vertex contains a shorter expression than 
the one above it, so the depth of the tree is bounded by the length of the 
given expression. 

Secondly, the choices made by the procedure could not have been 
made differently. For example, in step 3 we arrive at an expression a. We 
could not use less than a for a constituent, because it would not have a 
balance between left and right parentheses (as required by Lemma 13A). 


* If not, then the expression here is not a wff. We reject the given expression as a non-wff, 
and halt. 

* If the end of the expression is reached before finding such an a, then the expression 
here is not a wff. We reject the given expression as a non-wff, and halt. 
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We could not use more than a, because that would have the proper initial 
segment a that was balanced (violating Lemma 13B). Thus a is forced 
upon us. And then the choice of the principal connective is inevitable. 
We conclude that this algorithm constructs the only possible tree for the 
given expression. 

Thirdly, if the algorithm uses the footnotes to reject the given expres¬ 
sion, then that expression could not have been a wff—the rejection is 
correct. That is because the only possible attempt to make its tree failed. 

Finally, if the algorithm does not use the rejection footnotes, then the 
given expression is indeed a legal wff. That is because we have its tree; 
by working our way up the tree, we find inductively that every vertex 
has a wff, including the top vertex. 

The second remark above lets us conclude that our language has 
enough parentheses; each wff has a unique tree of the sort constructed 
here. We have “unique readability.” And not only does a unique tree exist 
for a wff, we know how to construct it; we can carry out this algorithm, 
using a sufficiently large piece of paper. 

Now to go back to the question of the existence of the extension 
v of a truth assignment v: It is the uniqueness of the trees that is the 
crucial fact here. For any wff (p there is a unique tree constructing it. 
By working our way up this tree, assigning a truth value v(a) at each 
vertex a, we can unambiguously arrive at a value for v(<p). And the 
function described in this way meets conditions 0-5 listed at the start 
of Section 1.2. And not only that, we know how, given (p and the values 
of v at its sentence symbols, to carry out the calculation of v(tp). 

Thus, using the parsing algorithm, we can make a function v as 
described in Theorem 12A. And there can be only one such v; cf. Ex¬ 
ercise 14 in Section 1.2. 

The only reason the existence of v is an issue at all is that in Sec¬ 
tion 1.2 it is described by recursion. That is, v((p) is specified by making 
use of the function v itself, applied to smaller formulas. In the next 
section, we approach the matter of defining a function by recursion 
more generally. By treating the subject more abstractly, we can better 
isolate what is at stake, 

Polish Notation 

It is possible to avoid both ambiguity and parentheses. This can be done 
by a very simple device. Instead of, for example, (a A P) we use Aa/L 
Let the set of P-wffs be the set generated from the sentence symbols by 
the five operations 

TL,(a) = —i a, 2? V (<A P) — Va/1, 

V/^ia, P) — A a/3, V^(a, P) — — >aP, 

V^(a, P) = ±-^u.p. 



Chapter 1: Sentential Logic 


33 


For example, one P-wff is 

—> A AD V -i B <-» CB. 

Here the need for an algorithm to analyze the structure is quite ap¬ 
parent. Even for the short example above, it requires some thought to 
see how it was built up. We will give a unique readability theorem for 
such expressions in Section 2.3. 

This way of writing formulas (but with N, K , A, C, and E in place 
of -i, A, V, —>, and •*->■, respectively) was introduced by the Polish logi¬ 
cian Lukasiewicz. The notation is well suited to automatic processing. 
Computer compiler programs often begin by converting the formulas 
given them into Polish notation. 

Omitting Parentheses 

Hereafter when naming wffs, we will not feel compelled to mention 
explicitly every parenthesis. To establish a more compact notation, we 
now adopt the following conventions: 

1. The outermost parentheses need not be explicitly mentioned. For 
example, when we write “A A B” we are referring to (A A B). 

2. The negation symbol applies to as little as possible. For example, 
-i A A B is (—1 A) A B, i.e., ((—i A) A B). It is not the same as (-i (A A 
B)). 

3. The conjunction and disjunction symbols apply to as little as pos¬ 
sible, given that convention 2 is to be observed. For example, 

A A B -» -> C V D is ((A A B) —► ((—■ C) V D)). 

4. Where one connective symbol is used repeatedly, grouping is to 
the right: 

ffA^Ay is a A (/3 A y), 
a —► —► y is a —► (fi —► y). 

It must be admitted that these conventions violate what was said ear¬ 
lier about naming expressions. We can get away with this only because 
we no longer have any interest in naming expressions that are not wffs. 


Exercises 

1. Rewrite the tautologies in the “selected list” at the end of Section 1.2, 
but using the conventions of the present section to minimize the 
number of parentheses. 

2. Give an example of wffs a and f3 and expressions y and <5 such that 
(a A ji) — (y A 8) but a ^ y. 
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3. Carry out the argument for Lemma 13B for the case of the operation 

4. Suppose that we modify our definition of wff by omitting all right 
parentheses. Thus instead of 

((A A (—1B)) —> (C V D)) 

we use 

((A A (-i B —> (C V D. 

Show that we still have unique readability (i.e., each wff still has 
only one possible decomposition). Suggestion: These expressions 
have the same number of parentheses as connective symbols. 

5. The English language has a tendency to use two-part connectives: 
“both ... and ...” “either ... or ...” “if ..., then ....” How does 
this affect unique readability in English? 

6 . We have given an algorithm for analyzing a wff by constructing its 
tree from the top down. There are also ways of constructing the tree 
from the bottom up. This can be done by looking through the formula 
for innermost pairs of parentheses. Give a complete description of 
an algorithm of this sort. 

7. Suppose that left and right parentheses are indistinguishable. Thus, 
instead of (a V (/3 A y)) we have \a V |/3 A y\\. Do formulas still 
have unique decomposition? 


SECTION 1.4 _ 

Induction and Recursion 1 

Induction 

There is one special type of construction that occurs frequently both in 
logic and in other branches of mathematics. We may want to construct 
a certain subset of a set U by starting with some initial elements of U, 
and applying certain operations to them over and over again. The set we 
seek will be the smallest set containing the initial elements and closed 
under the operations. Its members will be those elements of U which 
can be built up from the initial elements by applying the operations 
some finite number of times. 


1 On the one hand, the concepts in this section are important, and they arise in many 
places throughout mathematics. On the other hand, readers may want to postpone — not 
skip — study of this section. 
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In the special case of immediate interest to us, U is the set of expres¬ 
sions, the initial elements are the sentence symbols, and the operations 
are ££ A , etc. The set to be constructed is the set of wffs. But we will 
encounter other special cases later, and it will be helpful to view the 
situation abstractly here. 

To simplify our discussion, we will consider an initial set B C U 
and a class T of functions containing just two members / and g, where 

/ : U xU U and g : U -* U. 

Thus / is a binary operation on U and g i s a unary operation. (Actu¬ 
ally T need not be finite; it will be seen that our simplified discussion 
here is, in fact, applicable to a more general situation. T can be any 
set of relations on U , and in Chapter 2 this greater generality will be 
utilized. But the case discussed here is easier to visualize and is gen¬ 
eral enough to illustrate the ideas. For a less restricted version, see 
Exercise 3.) 

If B contains points a and h. then the set C we wish to construct will 
contain, for example, 

b, fib , b ), g(a), f{g{a), fib , b)), gifigia ), fib , b ))). 

Of course these might not all be distinct. The idea is that we are given 
certain bricks to work with, and certain types of mortar, and we want C 
to contain just the things we are able to build. 

In defining C more formally, we have our choice of two definitions. 
We can define it “from the top down” as follows: Say that a subset S of 
U is closed under / and g iff whenever elements x and y belong to S, 
then so also do fix, y ) and gix). Say that S is inductive iff B C S and 
S is closed under / and g. Let C* be the intersection of all the inductive 
subsets of U ; thus x e C* iff .r belongs to every inductive subset of 
U. It is not hard to see (and the reader should check) that C* is itself 
inductive. Furthermore, C* is the smallest such set, being included in 
all the other inductive sets. 

The second (and equivalent) definition works “from the bottom up.” 
We want C* to contain the things that can be reached from B by applying 
/ and g a finite number of times. Temporarily define a construction 
sequence to be a finite sequence (jci , ..., x„) of elements of U such that 
for each i < n we have at least one of 

xi e B, 

Xj — f(Xj, Xk ) for some j < i,k < i, 
x, — gixj ) for some j < i. 

In other words, each member of the sequence either is in B or results 
from earlier members by applying / or g. Then let C* be the set of all 
points x such that some construction sequence ends with x. 
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Let C n be the set of points x such that some construction sequence 
of length n ends with x. Then C\ — B, 

Ci c C 2 c C 2 c • • •, 

and C* = U„ C n . For example, g(f{a, fib , b ))) is in C$ and hence in 
C*, as can be seen by contemplating the tree shown: 

g{f {a J{b, b ))) 
fiajib, b)) 



a fib, b) 



b b 


We obtain a construction sequence for gif (a, fib, b))) by squashing 
this tree into a linear sequence. 

Examples 

1. The natural numbers. Let U be the set of all real numbers, 
and let B — {0}. Take one operation 5, where S(x) = x +1. Then 

C* = {0,1,2,...}. 

The set C* of natural numbers contains exactly those numbers 
obtainable from 0 by applying the successor operation repeatedly. 

2. The integers. Let U be the set of all real numbers; let B — 
{0}. This time take two operations, the successor operation 5 and 
the predecessor operation P: 

Six) = x + 1 and P(x) = x — 1. 

Now C* contains all the integers, 

C* = {..., -2, -1,0, 1,2,...}. 

Notice that there is more than one way of obtaining 2 as a member 
of C*. For 2 is 5(5(0)), but it is also 5(F(5(5(0))». 

3. The algebraic functions. Let U contain all functions whose 
domain and range are each sets of real numbers. Let B contain 
the identity function and all constant functions. Let T contain the 
operations (on functions) of addition, multiplication, division, and 
root extraction. Then C* is the class of algebraic functions. 

4. The well-formed formulas. Let U be the set of all expres¬ 
sions and let B be the set of sentence symbols. Let T contain the 
five formula-building operations on expressions: £-,, £ A , £ v , £^>, 
and . Then C* is the set of all wffs. 
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At this point we should verify that our two definitions are actually 
equivalent, i.e., that C* — C*. 

To show that C* C C* we need only check that C* is inductive, i.e., 
that B C C* and C* is closed under the functions. Clearly B = C\ C C*. 
If x and y are in C*, then we can concatenate their construction sequences 
and append fix, y) to obtain a construction sequence placing f(x, y) 
in C*. Similarly, C„ is closed under g. 

Finally, to show that C* C C* we consider a point in C* and a 
construction sequence {xq,..., x n ) for it. By ordinary induction on i, 
we can see that x-, e C*, i < n. First xq e B C. C*. For the inductive 
step we use the fact that C* is closed under the functions. Thus we 
conclude that 

C„ — C* — C* — P|{5 | S is inductive}. 

n 

(A parenthetical remark: Suppose our present study is embedded in 
axiomatic set theory, where the natural numbers are usually defined 
from the top down. Then our definition of C* (employing finiteness and 
hence natural numbers) is not really different from our definition of C*. 
But we are not working within axiomatic set theory; we are working 
within informal mathematics. And the notion of natural number seems 
to be a solid, well-understood intuitive concept.) 

Since C* — C*, we call the set simply C and refer to it as the set 
generated from B by the functions in T. We will often want to prove 
theorems by using the following: 

Induction Principle Assume that C is the set generated from B by 
the functions in T. If S is a subset of C that includes B and is 
closed under the functions of T then S = C. 

Proof. S is inductive, so C — C* c. S. We are given the other 
inclusion. H 

The special case now of interest to us is, of course. Example 4. Here 
C is the class of wffs generated from the set of sentence symbols by the 
formula-building operations. This special case has interesting features 
of its own. Botha and ft are proper segments of £/\(a, f), i.e., of (a A ft). 
More generally, if we look at the family tree of a wff, we see that each 
constituent is a proper segment of the end product. 

(As v (A 8 «-*■ A 3 » 

/ \ 

A 3 (A* *-* A 3 ) 
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Suppose, for example, that we temporarily call an expression spe¬ 
cial if the only sentence symbols in it are among {A2, A3, A5} and the 
only connective symbols in it are among {—1, —»}. Then no special wff 
requires A9 or £,, for its construction. In fact, every special wff belongs 
to the set C s generated from {A2, A3, Ag) by £- and £_>. (We can use 
the induction principle to show that every wff either belongs to C s or is 
not special.) 

Recursion 

We return now to the more abstract case. There is a set U (such as the 
set of all expressions), a subset B of U (such as the set of sentence 
symbols), and two functions / and g, where 

/ :U xU U and g : U U. 

C is the set generated from B by f and g. 

The problem we now want to consider is that of defining a function 
on C recursively. That is, we suppose we are given 

1 . Rules for computing h(x) for x e B. 

2 a. Rules for computing h(f(x, y)), making use of h(x) and h(y). 
2 b. Rules for computing h(g(x)), making use of h(x ). 

(For example, this is the situation discussed in Section 1 . 2 , where h is 
the extension of a truth assignment for B.) It is not hard to see that there 
can be at most one function h on C meeting all the given requirements. 

But it is possible that no such h exists; the rules may be contradictory. 
For example, let 

U — the set of real numbers, 

B = { 0 }, 

fix, y) — x • y, 

g(x ) =x+l- 

Then C is the set of natural numbers. Suppose we impose the following 
requirements on h : 

1 . h{ 0 ) = 0 . 

2a. hifix,y)) = f(hix),hiy)). 

2 b. higix)) = hix) + 2. 

Then no such function h can exist. (Try computing hi I), noting that we 
have both 1 = g(0) and 1 = f(g(0), g(0)).) 

A similar situation is encountered in algebra. 2 Suppose that you 
have a group G that is generated from B by the group multiplication 


2 It is hoped that examples such as this will be useful to the reader with some algebraic 
experience. The other readers will be glad to know that these examples are merely 
illustrative and not essential to our development of the subject. 
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and inverse operation. Then an arbitrary map of B into a group H is 
not necessarily extendible to a homomorphism of the entire group G 
into H. But if G happens to be a free group with set B of independent 
generators, then any such map is extendible to a homomorphism of the 
entire group. 

Say that C is freely generated from B by f and g iff in addition to 
the requirements for being generated, the restrictions fc and gc of / 
and g to C meet the following conditions: 

1. fc and gc are one-to-one. 

2 . The range of fc, the range of gc, and the set B are pairwise 
disjoint. 

The main result of this section, the recursion theorem, says that if C 
is freely generated then a function h on B always has an extension h on 
C that follows the sorts of rules considered above. 

Recursion Theorem Assume that the subset C of U is freely gen¬ 
erated from B by f and g, where 

f :U xU U, 

g : U U. 

Further assume that V is a set and F , G, and h functions such that 

h : B V, 

F : V x V -> V, 

G : V -> V. 

Then there is a unique function 

h:C -> V 


such that 

(i) For x in B, h(x) — h(x)\ 

(ii) For x, y in C, 

Hf(x,y))= F(h(x)My)), 

h(g(x)) = G(h(x)). 

Viewed algebraically, the conclusion of this theorem says that any 
map h of B into V can be extended to a homomorphism h from C (with 
operations / and g) into V (with operations F and G). 

If the content of the recursion theorem is not immediately apparent, 
try looking at it chromatically. You want to have a function h that paints 
each member of C some color. You have before you 

1 . h, telling you how to color the initial elements in B ; 

2 . F, which tells you how to combine the color of x and y to obtain 
the color of /(v, y) (i.e,, it gives h(f(x, y)) in terms of h(x) and h{y))\ 
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3. G, which similarly tells you how to convert the color of x into the 
color of g(x). 

The danger is that of a conflict in which, for example, F is saying 
“green” but G is saying “red” for the same point (unlucky enough to 
be equal both to f(x, y) and g(z) for some x, y, z). But if C is freely 
generated, then this danger is avoided. 

Examples. Consider again the examples of the preceding subsection. 

1. B — {0} with one operation, the successor operation S. 
Then C is the set N of natural numbers. Since the successor oper¬ 
ation is one-to-one and 0 is not in its range, C is freely generated 
from {0} by S. Therefore, by the recursion theorem, for any set 
V, any a e V, and any F : V —> V there is a unique h : N —»■ V 
such that h( 0) = a and h(S(x)) — F(h(x )) for each a e N. 
For example, there is a unique h : N -> N such that /i(0) = 0 
and h(S(x)) — 1 — h(x). This function has the value 0 at even 
numbers and the value 1 at odd numbers. 

2. The integers are generated from {0} by the successor and 
predecessor operations but not freely generated. 

3. Freeness fails also for the generation of the algebraic func¬ 
tions in the manner described. 

4. The wffs are freely generated from the sentence symbols by 
the five formula-building operations. This fact is implicit in the 
parsing algorithm of the preceding section; we now want to focus 
on it here: 

Unique Readability Theorem The five formula-building operations, 
when restricted to the set of wffs, 

(a) Have ranges that are disjoint from each other and from the 
set of sentence symbols, and 

(b) Are one-to-one. 

In other words, the set of wffs is freely generated from the set 
of sentence symbols by the five operations. 

Proof. To show that the restriction of £ A is one-to-one, suppose 
that 


(a A >3) = (y A 5), 

where a, fi, y, and S are wffs. Delete the first symbol of each 
sequence, obtaining 

a A ft) = y A 8). 

Then we must have a — y, lest one be a proper initial segment of 
the other (in contradiction to Lemma 13B). And then it follows at 
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once that ft = S. The same argument applies to £ v , £—>, and £^\ 
for E-, a simpler argument suffices. 

A similar line of reasoning tells us that the operations have 
disjoint ranges. For example, if 

(a A ft) = (y —■> 5) 

where a, ft, y, and S are wffs, then as in the above paragraph 
we have a = y. But that implies that A = —>, contradicting 
the fact that our symbols are distinct. Hence £ t , and (when 
restricted to wffs) have disjoint ranges. Similarly for any two 
binary connectives. 

The remaining cases are simple. If (—ia) — (ft A y ), then 
ft begins with -i, which no wff does. No sentence symbol is a 
sequence of symbols beginning with (. H 

Now let us return to the question of extending a truth assignment v 
to v. First consider the special case where v is a truth assignment for 
the set of all sentence symbols. Then by applying the unique readability 
theorem and the recursion theorem we conclude that there is a unique 
extension v to the set of all wffs with the desired properties. 

Next take the general case where v is a truth assignment for a set S 
of sentence symbols. The set S generated from S by the five formula¬ 
building operations is freely generated, as a consequence of the unique 
readability theorem. So by the recursion theorem there is a unique ex¬ 
tension v of v to that set, having the desired properties. 

Example. We can apply the recursion theorem to establish that there 
is a unique function ft defined on the set of wffs such that 

ft (A) = 1 for a sentence symbol A, 
ft ((—i a)) = 3 + h{a), 

Ti((a Aft)) = 3 + Ti(a) + h(ft), 

and similarly for V, —», and •*->. This function gives the length of 
each wff. 

Proof of the Recursion Theorem. The idea is to let ft be the union 
of many approximating functions. Temporarily call a function v 
(which maps part of C into V ) acceptable if it meets the conditions 
imposed on ft by (i) and (ii). More precisely, v is acceptable iff 
the domain of v is a subset of C, the range a subset of V, and for 
any a and y in C: 

(i') If a belongs to B and to the domain of v, then v (x ) = h(x). 
(ii') If f(x, y ) belongs to the domain of v, then so do a and y, 
and u(/( a, y)) = F(v(x), u(y)). If g(x) belongs to the domain 
of v, then so does a, and v(g(x)) = G( v(x)). 
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Let K be the collection of all acceptable functions, and let h — 
(J K, the union of all the acceptable functions. Thus 

(x, z) e h iff (x, z) belongs to some acceptable v 

iff v(x) = z for some acceptable v. " " ' 

We claim that h meets our requirements. The argument is set- 
theoretic, and comprises four steps. First, here is an outline of 
four steps: 

1. We claim that h is a function (i.e., that it is single-valued). 
Let 

S — {x e C | for at most one z, (x, z) e h) 

— {x e C | all acceptable functions defined at x agree 
there} 

It is easy to verify that S is inductive, by using (i') and (ii'). Hence 
S = C and h is a function. 

2. We claim that h e K; i.e., that h itself is an acceptable 
function. This follows fairly easily from the definition of h and 
the fact that it is a function. 

3. We claim that h is defined throughout C. It suffices to show 
that the domain of h is inductive. It is here that the assumption of 
freeness is used. For example, one case is the following: Suppose 
that x is in the domain of h. Then h; (g(x), G(h(x))) is accept¬ 
able. (The freeness is required in showing that it is acceptable.) 
Consequently, g (x) is in the domain of /?. 

4. We claim that h is unique. For given two such functions, 

let S be the set on which they agree. Then S is inductive, and so 
equals C. H 

Now for the details. 

1. As above, let 

S — {x e C | for at most one z, (x, z) e h) 

— {x e C | all acceptable functions defined at x agree there} 

Toward showing that S is inductive, first consider some x in B. 
Suppose that iq and ih are acceptable functions defined at x; we 
seek to show that V\ (x) — V2M. But condition (i') tells us that 
both t>i(x) and ih(x) must equal h{x), so indeed i>i(x:) = V 2 (x). 
This shows that x e S\ since x was an arbitrary member of B we 
have B c S. 

Secondly we must check that S is closed under / and g. So 
suppose that some x and y are in S\ we ask whether fix, y ) is in 

S. So suppose that V\ and ih are acceptable functions defined at 
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fix, v); we seek to show that they agree there. But condition (ii') 
tells us that v\(f(x,y)) — F{v\fx), i>i(y)) and t> 2 if fx,y)) = 
F(v 2 (x), V 2 (y)). And because x and y are in S, we have i>i(x) = 
i> 2 (x) and Ui(y) = i> 2 (y) (and these are defined). So we conclude 
that v\iffx,y)) = V 2 (f(x,y)). This shows that f(x,y ) e S. 
Hence S is closed under /. A similar argument shows that S is 
closed under g. 

Thus S is inductive and so S = C. This shows that h is single¬ 
valued, i.e., is a function. Because h includes every acceptable 
function as a subset, we can say that 

h{x) — v(x) 

whenever v is an acceptable function and x e dom v. 

2. We claim that h is acceptable. Clearly dom h C C and 
ran h C V (by (*)). and we have just verified that h is a function. 
It remains to check that h satisfies conditions (i') and (ii'). 

First we examine (i'). Assume x e B and x e dom h (so that 

( x,h(x)) e /O.Theremustbesomeacceptablei;withi)(x) = /K.v)- 

Because v satisfies (i'), wehave vfx) = h(x) whence h(x) — hfx). 
So h satisfies (i'). 

Secondly we examine (ii'). Assume that f(x,y ) e dom/ 7 . 
Again there must be some acceptable v with v(f(x,y)) = 
h(f(x,y)). Because v satisfies (ii'), we have u(/(x,y)) = 
F(v(x), v(y)). Now h(x) = vfx) and hfy) = v(y) and hence 

h( fix, y)) = vifix, >’)) = F(v(x), v(y)) = F(h(x), hfy)). 

In a similar way, we find that h(g(x)) = G(hfx)) whenever 
gfx) e dom/7. Hence h meets condition (ii') and so is accept¬ 
able. 

3. Next we must show that dom h is inductive. First consider 
a point x in B. Then the set {{x, hfx))} is a (small) acceptable 
function. For it clearly satisfies (i'). It also satisfies (ii') because 
x f ran /c and x f range- Thus {(x, hfx))} is acceptable and 
therefore is included in h. Hence x e domh. This shows that 
B C domh. 

We further claim that domh is closed under / and g. Toward 
this end, consider any s and t in domh. We hope that f{s, t) e 
domh. But if not, then let 

v = hU{(f(s,t),F(h(s),hm}, 

the result of adding this one additional pair to h. It is clear that v is 
a function, dom v c C, and ran v c V. We claim that v satisfies 
(i') and (ii'). 
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First take (i'). If x e fin dom v then x ^ f(s, t), by freeness. 
Hence x e dom/z and we have v(x) = h(x) — h(x). 

Next take (ii'). Assume that fix, y) e domw for some x and 
y in C. If f(x,y) e dom h then v(f(x,y)) = h{f(x,y)) — 
F(h{x), h(y )) = F(v(x), u(y)) since h is acceptable. The other 
possibility is that fix, y) = f(s, t). Then by freeness we have 
x — s and y = t, and we know that these points are in dom/z c 
dom v. By construction, 

i >(f(s,t)) = F(h(s),h(t)) 

= F(v(s), v(t)). 

Finally suppose that g(x) e dom v for x in C. Then by freeness 
we have g(x) f Hence g(x) e dom/z, and consequently 

»>(g(*)) = higix)) = G(h{x)) = G(v(x)). 

Thus v is acceptable. But that tells us that v C h, so that 
f(s,t) e dom/z after all. 

A similar argument shows that dom h is closed under g as well. 
Hence dom/z is inductive and therefore coincides with C. 

4. To show that h is unique, suppose that h i and /z 2 both satisfy 
the conclusion of the theorem. Let S be the set on which they agree: 

S — [x e C | h\{x) = /z 2 (x)}. 

Then it is not hard to verify that S is inductive. Consequently 
S — C and hi = /z 2 . H 

One final comment on induction and recursion: The induction prin¬ 
ciple we have stated is not the only one possible. It is entirely possible to 
give proofs by induction (and definitions by recursion) on the length of 
expressions, the number of places at which connective symbols occur, 
etc. Such methods are inherently less basic but may be necessary in 
some situations. 


Exercises 

1. Suppose that C is generated from a set B — [a, h\ by the binary 
operation f and unary operation g. List all the members of C 2 . How 
many members might C3 have? C4? 

2 . Obviously (A3 —* AA4) is not a wff. But prove that it is not a wff. 

3. We can generalize the discussion in this section by requiring of T 
only that it be a class of relations on U. C* is defined as before, 
except that [x$, x\, ..., x n ) is now a construction sequence provided 
that for each i < n we have either x,- e B or (x ; -,,..., Xj k , x ( ) e R 
for some R e T and some j\,, ji, all less than i. Give the correct 
definition of C* and show that C* = C*. 
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SECTION 1.5 _ 

Sentential Connectives 

We have thus far employed five sentential connective symbols. Even 
in the absence of a general definition of “connective,” it is clear that 
the five familiar ones are not the only ones possible. Would we gain 
anything by adding more connectives to the language? Would we lose 
anything by omitting some we already have? 

In this section we make these questions precise and give some an¬ 
swers. First consider an informal example. We could expand the lan¬ 
guage by adding a three-place sentential connective symbol #, called 
the majority symbol. We allow now as a wff the expression ( #a/3y ) 
whenever a, f, and y are wffs. In other words, we add a sixth formula¬ 
building operator to our list: 

£#(a, P, Y) = ( #uPy )• 

Then we must give the interpretation of this symbol. That is, we must 
say how to compute vifttafy)), given the values v(a), i J(/3), and v(y). 
We choose to define 

U((#a/3y)) is to agree with the majority of v(a),jJ(jB),U(y). 

We claim that this extension has gained us nothing, in the following 
precise sense: For any wff in the extended language, there is a tauto¬ 
logically equivalent wff in the original language, (On the other hand, 
the wff in the original language may be much longer than the wff in 
the extended language.) We will prove this (in a more general situa¬ 
tion) below; here we just note that it relies on the fact that ( #afiy ) is 
tautologically equivalent to 

(a A P) V (a A y) V (fi A y). 

(We note parenthetically that our insistence that viifafy)) be cal¬ 
culable from (Ufa), T(/)), v(y)) plays a definite role here. In everyday 
speech, there are unary operators like “it is possible that” or “I believe 
that.” We can apply one of these operators to a sentence, producing a 
new sentence whose truth or falsity cannot be determined solely on the 
basis of the truth or falsity of the original one.) 

In generalizing the foregoing example, the formal language will be 
more of a hindrance than a help. We can restate everything using only 
functions. Say that a A:-place Boolean function is a function from { F, T} k 
into {F, T}. (A Boolean function is then anything which is a A:-place 
Boolean function for some k. We stretch this slightly by permitting 
F and T themselves to be 0-place Boolean functions.) Some sample 
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Boolean functions are defined by the equations (where XejF, 7’)) 


I?(X u ...,X n ) = X it 
N(F ) = T, N(T ) = F, 

K(T,T) = T, K(F, X) = K(X, F) = F, 
A(F, F) = F, A(T, X) = A{X, T) = T, 
C(T, F) = F, C(F, X) — C(X, T) = T, 
E(X, X) = T, E(T, F) = E(F, T) = F. 


From a wff a we can extract a Boolean function. For example, if a 
is the wff Ai A A 2 , then we can make a table. Table V. The 2 2 lines 
of the table correspond to the 2 2 truth assignments for {Aj, A 2 }. For 
each of the 2 2 pairs X, we set B U (X) equal to the truth value a receives 
when its sentence symbols are given the values indicated by X. 


TABLE V 


Aj 

a 2 

Ai A A 2 



F 

F 

F 

B a (F , F) = 

F 

F 

T 

F 

BAF, T) = 

F 

T 

F 

F 

BAT, F) = 

F 

T 

T 

T 

BAT , T) = 

T 


In general, suppose that a is a wff whose sentence symbols are at 
most A], ..., A„. We define an n -place Boolean function B" (or just 
B a if n seems unnecessary), the Boolean function realized by a, by 


B"(X ,X n ) — the truth value given to a when 

Aj,..., A„ are given the values X\,... ,X n . 


Or, in other words, B"{X \,..., X n ) = v(a), where v is the truth as¬ 
signment for {Aj, ..., A„} for which v(A,) = X,. Thus B" t comes from 
looking at v(a) as a function of v, with a fixed. 

For example, the Boolean functions listed previously are obtainable 
in this way: 


in _ r>n 

A — , 
n = bL a 


K = Bi lAA2 , 

A = B\ iVA 2 , 

C=bI^ A2 , 

E = B*. 


From these functions we can compose others. For example, 


Bl AlV ^ A2 (Xu X 2 ) = A(N(lf(Xi, X 2 )), 7V(7|(A 1 , X 2 )». 
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( The right-hand side of this equation can be compared with the result of 
putting —i Ai V —i A 2 into Polish notation.) We will shortly come to the 
question whether every Boolean function is obtainable in this fashion. 

As the theorem below states, in shifting attention from wffs to the 
Boolean functions they realize, we have in effect identified tautologi¬ 
cally equivalent wffs. Impose an ordering on {F, 7’} by defining F < T. 
(If F — 0 and 7=1, then this is the natural order.) 

Theorem 15A Let a and p be wffs whose sentence symbols are 

among A], ..., A„. Then 

(a) a |= p iff for all X e {7, T} n , B a (X) < B P (X). 

(b) a f==| P iff B a — Bp. 

(c) \= a iff B a is the constant function with value T. 

Proof of (a), a |= p iff for all 2" truth assignments v for Ai,..., 
A„, whenever v(a) = T, then also v(P) = T. (This is true even if 
the sentence symbols in a and p do not include all of Ai, ..., A„; 
cf. Exercise 6 of Section 1.2.) Thus 

a \= p iff for all 2" assignments v, v(a) — T => v(P) — T , 

iff for all 2" n -tuples X. B n a (X) = T => B n p (X) = T , 

iff for all 2" n -tuples Z, B n a (X) < B n p (X ), 

where F < T. H 

In addition to identifying tautologically equivalent wffs, we have 
freed ourselves from the formal language. We are now at liberty to 
consider any Boolean function, whether it is realized by a wff or not. 
But this freedom is only apparent: 

Theorem 15B Let G be an u-place Boolean function, n > 1. We 
can find a wff a such that G — />’", i.e., such that a realizes the 
function G. 

The primary reason for introducing Boolean functions is to allow us 
to formulate this theorem. The theorem was stated by Emil Post in 1921. 

Proof. Case I: G is the constant function with value F. Let a = 
Ai A -1 Ai. 

Case II: Otherwise there are k points at which G has the value 
T. k > 0. List these: 

Xi = (X n ,X 12 ,...,X ln ), 

X 2 = (X 21 ,X 2 2,...,X 2n ), 

X k = {X kl ,x k2 ,...,x k „). 
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Let 


_ (Aj iff X U = T, 

PlJ \ (-. A j) iff Xjj = F, 

Yi = fit \ A ■ ■ ■ A flint 
0 / = Yi V Y2 v • • • V Yk- 

We claim that G = B". 

At this point it might be helpful to consider a concrete example. 
Let G be the three-place Boolean function as follows: 

G(F, F, F) = F, 

G(F, F, T ) = T, 

G(F, T, F ) = T, 

G(F, T, T) = F, 

G(T, F, F) = T, 

G(T , F, T) = F, 

G(T, T, F) = F, 

G(T, 7, T) = T. 


Then the list of triples at which G assumes the value T has four 
members: 

FFT -i A[ A -i A 2 A A 3i 
FTF -i Ai A A 2 A-i A 3 , 

TFF Ai A i A 2 A i A 3 , 

TTT Ai A A 2 A A 3 . 

To the right of each triple above is written the corresponding 
conjunction y,-. Then a is the formula 

(-. Ai A -1A 2 A A3) V (-1 Aj A A 2 A -1A 3 ) V 
(AjA-iAiA-i A 3 ) V (Ai A A 2 A A 3 ). 

Notice how a lists explicitly the triples at which G assumes the 
value T. 

To return to the proof of the theorem, note first that B"(X{) = T 
for 1 < i < k. (For the truth assignment corresponding to X ,■ 
satisfies y, and hence satisfies a.) On the other hand, only one 
truth assignment for {Aj,..., A,,} can satisfy y,, whence only k 
such truth assignments can satisfy a. Hence B"(X) — F for the 
2" — k other n-tuples Y. Thus in all cases, B"(Y) — G{Y). H 


From this theorem we know that every Boolean function is realiz¬ 
able, Of course the a that realizes G is not unique; any tautologically 
equivalent wff will also realize the same function. It is sometimes of 
interest to choose a to be as short as possible. (In the example done 
above, the wff 


Ai *—y A 2 <—y A 3 


also realizes G.) 
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As a corollary to the above theorem, we may conclude that we have 
enough (in fact, more than enough) sentential connectives. For suppose 
that we expand the language by adding some exotic new sentential con¬ 
nectives (such as the majority connective discussed at the beginning of 
this section). Any wff <p of this expanded language realizes a Boolean 
function B" . By the above theorem we have a wff a of the original lan¬ 
guage such that B” = B". Hence (p and a are tautologically equivalent, 
by Theorem 15A. 

In fact, the proof shows that a can be of a rather special form. For 
one thing, the only sentential connective symbols in a are A, V, and 
-i. Furthermore, a is in so-called disjunctive normal form (abbreviated 
DNF). That is, a is a disjunction 

a = yi V • • • V Yk, 

where each y, is a conjunction 

Yi = fin A • • • A film 

and each fijj is a sentence symbol or the negation of a sentence symbol. 
(The advantages of wffs in disjunctive normal form stem from the fact 
that they explicitly list the truth assignments satisfying the formula.) 
Thus we have the consequence: 

Corollary 15C For any wff <p, we can find a tautologically equiv¬ 
alent wff a in disjunctive normal form. 

Because every function G : { F, -> { F, 7’} for n > 1 can be 
realized by a wff using only the connective symbols in {A, V, —>}, we 
say that the set {A, V, —■} is complete. (Actually the completeness is 
more a property of the Boolean functions K , A , and N that correspond to 
these symbols. But the above terminology is convenient.) Once we have 
a complete set of connectives, we know that any wff is tautologically 
equivalent to one all of whose connectives are in that set. (For given any 
wff <p, we can make a using those connectives and realizing B ip . Then 
a |==] (p.) The completeness of {A, V, -i) can be improved upon: 

Theorem 15D Both {—i. A} and {—i, V} are complete. 

Proof. We must show that any Boolean function G can be realized 
by a wff using only, in the first case, {—>, A}. We begin with a wff a 
using {—i, A, V} that realizes G. It suffices to find a tautologically 
equivalent a’ that uses only {—i. A}. For this we use De Morgan’s 
law: 


fi V y |==l - 1 (“> fi A y). 

By applying this repeatedly, we can completely eliminate V 
from a. 
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(More formally, we can prove by induction on a that there is a 
tautologically equivalent a' in which only the connectives A, — i 
occur. Two cases in the inductive step are 

Case -i: If a is (—i /8), then let a' be (-1 /3'). 

Case V: If a is (/3 V y), then let a 1 be —1 (— 1 f}' A —■ y'). Since 
/S' and y' are tautologically equivalent to /6 and y, respectively, 

a' = -1 (—1 /S' A —1 y') 

h=l -(-M-r) 

NH /ivy 

= a. 

In future proofs that a set of connectives is complete, this induction 
will be omitted. Instead we will just give, for example, the method 
of simulating V by using —1 and A. H 

Showing that a certain set of connectives is not complete is usually 
more difficult than showing that one is complete. The basic method is 
first to show (usually by induction) that for any wff a using only those 
connectives, the function B" has some peculiarity, and secondly to show 
that some Boolean function lacks that peculiarity. 

Example. {A, — *-} is not complete. 

Proof. The idea is that with these connectives, if the sentence sym¬ 
bols are assigned T, then the entire formula is assigned T. In 
particular, there is nothing tautologically equivalent to —1 A. 

In more detail, we can show by induction that for any wff a 
using only these connectives and having A as its only sentence 
symbol, we have A |= a. (In terms of functions, this says that 
B [ a (T) = T .) The same argument shows that {A, V, —>, <—►} is 
not complete, H 

For each n there are 2 2 ” n-place Boolean functions. Hence if we iden¬ 
tify a connective with its Boolean function (e.g., A with the function K 
mentioned before), we have 2 2 " n-ary connectives. We will now catalog 
these for n < 2. 

0-ary Connectives 

There are two 0-place Boolean functions, F and T. For the correspond¬ 
ing connective symbols we take _L and T. Now an n-ary connective 
symbol combines with n wffs to produce a new wff. When n = 0, we 
have that _L is a wff all by itself. It differs from the sentence symbols in 
that T(_L) = F for every v; i.e., _L is a logical symbol always assigned 
the value F. Similarly, T is a wff, and U(T) = T for every v. Then, for 
example, A —► _L is a wff, tautologically equivalent to -1 A, as can be 
seen from a two-line truth table. 
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Unary Connectives 

There are four unary connectives but only one of any interest. The 
interesting case is negation. The other three one-place Boolean functions 
are the identity function and the two constant functions. 

Binary Connectives 

There are 16 binary connectives, but only the last 10 listed in Table VI 
are “really binary.” 


TABLE VI 


Symbol 

Equivalent 

Remarks 


T 

two-place constant, essentially 0-ary 


± 

two-place constant, essentially 0-ary 


A 

projection, essentially unary 


B 

projection, essentially unary 


-.A 

negation, essentially unary 


-.B 

negation, essentially unary 

A 

A A B 

and; if F = 0 and T = 1, then this gives 
multiplication in the held {0, 1) 

V 

A VB 

or (inclusive) 

—► 

A —+ B 

conditional 


A«B 

biconditional 

4 - 

A 4- B 

reversed conditional 

+ 

(A V B) A -i (A A B) 

exclusive or, “A or B and not both”; if 

F = 0 and T = 1, then this gives the 
usual addition (modulo 2) in the 
held (0, 1} 

I 

-i (A V B) 

nor, “neither A nor B” 

1 

-i (A A B) 

nand, “not both A and B”; the symbol is 
called the Sheffer stroke 

< 

(-. A) A B 

the usual ordering, where F < T 

> 

AA(nB) 

the usual ordering, where F < T 


Ternary Connectives 

There are 256 ternary connectives; 2 are essentially 0-ary, 6 (= 2 - (j)) 
are essentially unary, and 30 (= 10 - (^)) are essentially binary. This 
leaves 218 that are really ternary. We have thus far mentioned only 
the majority connective #. There is, similarly, the minority connective. 
In Exercise 7 we encounter + 3 , ternary addition modulo 2. -ft'aft y is 
assigned the value T iff an odd number of a, ft, and y are assigned 
T. This formula is equivalent both to a -ft ft -ft y and to a <—* ft *-*■ y. 
Another ternary connective arises in Exercise 8. 
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Example. {|} and {44 are complete. 

Proof. For | 

—i a |==| a | a 

«v/s} N=l (--Of) | (-.0). 

Since {—i, V) is complete and -i, V can be simulated using only 
|, we conclude that {|} is complete. H 

Example. {—i, —»} is complete. In fact, of the 10 connectives that 
are really binary, eight have the property of forming, when added 
to -i, a complete set. The two exceptions are + and see 
Exercise 5. 

Example. {_L, —►} is complete. In fact, because with this set we can 
realize even the two 0-place Boolean functions, it is supercom- 
plete. 


Exercises 


1. Let G be the following three-place Boolean function. 


G(F, F, F) = T, 
G(F , F, T) = T, 
G(F , T, F) = T, 
G(F, T, T) = F, 


G(T, F, F ) = T, 
G(T, F, T) = F, 
G(T, T, F) = F, 
G(T , T, T) = F. 


(a) Find a wff, using at most the connectives V, A, and -i, that 
realizes G. 

(b) Then find such a wff in which connective symbols occur at not 
more than five places. 

2. Show that | and 4 are the only binary connectives that are complete 
by themselves. 

3. Show that {—i, #} is not complete. 

4. Let M be the ternary minority connective, (Thus v{Maf}y) always 
disagrees with the majority of v(a), I>(/3), and v(y).) Show the 
following: 

(a) {M, _L} is complete. 

(b) {M} is not complete. 

5. Show that {T, _L, -i ,<->,+} is not complete. Suggestion: Show that 
any wff a using these connectives and the sentence symbols A and 
B has an even number of T’s among the four possible values of 
v(a). 

Remark: Another approach uses the algebra of the field {0, 1}. 
Any Boolean function realizable with these connectives has a cer¬ 
tain linearity property. 
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6. Show that {A, <—►, +} is complete but that no proper subset is com¬ 
plete. 

7. Let + 3 be the ternary connective such that y is equivalent to 
a + P + Y ■ 

(a) Show that {T, _L, A, + 3 } is complete. 

(b) Show that no proper subset is complete. 

Remark: + 3 is the ternary parity connective; the condition for 
L(+a 1 a 2 « 3 ) — T is that uta,) = T for an odd number of i’s. 
+ is the binary parity connective. The function G in the proof of 
Theorem 15B is the 3-place parity function. 

8. Let I be the ternary connective such that la ft y is assigned the value 
T iff exactly one of the formulas a, f, y is assigned the value T. 
Show that there are no binary connectives o and A such that I a fly 
is equivalent to (a o /l)Ay. 

9. Say that a formula a is in conjunctive normal form (abbreviated 
CNF) iff it is a conjunction 

a = y x A • • • A y k 

where each y,- is a disjunction 

Vi = Al V ■ ■ ■ V fin 

and each /3 ( ; is either a sentence symbol or the negation of a sentence 
symbol. 

(a) Find a formula in conjunctive normal form that is tautologically 
equivalent to A <—* B <-» C. 

(b) Show that for any formula, we can find a tautologically equiv¬ 
alent formula in conjunctive normal form. 

10. Add the 0-place connectives T, _L to our language. For each wff 
ip and sentence symbol A, let tp A be the wff obtained from <p by 
replacing A by T. Similarly for <pj_. Then let <p£ = (ip A V <p±). 
Prove the following: 

(a) cp |= <p£. 

(b) If cp |= i/f and A does not appear in i jr, then |= f. 

(c) The formula ip is satisfiable iff <pjj is satisfiable. 

Remarks: We can think of <pjj as trying to say everything ip says, 
but without being able to use the symbol A. Parts (a) and (b) state 
that <p A t is the strongest A -free consequence of ip. The formulas <p 
and <p A are not tautologically equivalent in general, but they are 
“equally satisfiable” by part (c). The operation of forming <p A from 
cp is (in another context) called resolution on A. 

11. (Interpolation theorem) If a |= f , then there is some y all of whose 
sentence symbols occur both in a and in f and such that a |= y |= 
f. Suggestion: Use the preceding exercise. 
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Remarks: There is an analogue of Exercise 11 that holds for first- 
order logic. But the proof in that case is very different, because there 
is no analogue of Exercise 10. 

12. Is the set {A, T, _L} complete? Support your answer. 


SECTION 1.6 _ 

Switching Circuits 1 

Consider an electrical device (traditionally a black box) having n inputs 
and one output (Fig. 1). Assume that to each input we apply a signal 
having one of two values and that the output has one of two values. The 
two possible values we call F and T. (We could also define the F value 
as 0 potential and choose the unit of potential so that the T value has 
potential 1.) Further assume that the device has no memory; i.e., the 
present output level depends only on the present inputs (and not on past 
history). Then the performance of the device is described by a Boolean 
function: 

G(Xi,..., X„) — the output level given the input signalsXi,..., X„. 


A, 

X 2 

*3 


F(X t/ X 2 ,X 3 ) 


Figure 1 . Electrical device with three inputs. 


Devices meeting all these assumptions constitute an essential part of 
digital-computer circuitry. There is, for example, the two-input AND 
gate, for which the output is the minimum of the inputs (where F < T ). 
This device realizes the Boolean function K of the preceding section. 
It is convenient to attach the labels Ai and A 2 to the inputs and to label 
the output A! A A 2 . 

Similar devices can be made for other sentential connectives. For a 
two-input OR gate (Fig. 2) the output voltage is the maximum of the in¬ 
put voltages. Corresponding to the negation connective there is the NOT 
device (or inverter), whose output voltage is the opposite of the input 
voltage. 

A circuit can be constructed from various devices of this sort. And it 
is again natural to use wffs of our formal language to label the voltages 


1 This section, which discusses an application of the ideas of previous sections, may be 
omitted without loss of continuity. 
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OR 


Figure 2. OR gate. 

at different points (Fig. 3). Conversely, given the wff thus attached to 
the output, we can approximately reconstruct the circuit, which looks 
very much like the tree of the wff’s formation. 



Figure 3. Circuit with wffs as labels. 


For example, the circuit for 

((A A B) A D) V ((A A B) A -i C) 

would probably be as shown in Fig. 4. Duplication of the circuit for 
A A B would not usually be desirable. 



Figure 4. Circuit for ((A A B) A D) V ((A A B) A — iC). 


Tautologically equivalent wffs yield circuits having ultimately the 











56 


A Mathematical Introduction to Logic 


same performance, although possibly at different cost and (if the devices 
are not quite instantaneous in operation) different speed. Define the 
delay (also called the depth) of a circuit as the maximum number of 
boxes through which the signal can pass in going from an input to the 
output. The corresponding notion for formulas is conveniently defined 
by recursion. 

1. The delay of a sentence symbol is 0. 

2. The delay of —i a is one greater than the delay of a. 

3. The delay of a A/f is one greater than the maximum of the delay 
of a and the delay of ft. 

And similarly for any other connective. 

For example, the circuit of (Ai A A 2 ) V —1 A 3 uses three devices and 
has a delay of 2. The tautologically equivalent formula —1 (A 3 A (—1 A! V 
— 1 A 2 )) gives a circuit having five devices and a delay of 4. The problem 
facing many a computer engineer can be stated: Given a circuit (or its 
wff), find an equivalent circuit (or a tautologically equivalent wff) for 
which the cost is a minimum, subject to constraints such as a maximum 
allowable delay. For this problem there is some catalog of available 
devices; for example, she might have available 

NOT, two-input AND, three-input OR. 

(It is clearly desirable that the available devices correspond to a com¬ 
plete set of connectives.) The catalog of devices determines a formal 
language, having a connective symbol for each device. 


Example 1 . Inputs: A, B, C. Output: To agree with the majority of 
A, B, and C. Devices available: two-input OR, two-input AND. 
One solution is 

((A A B) V (A A C)) V (B A C), 

which uses five devices and has a delay of 3. But a better solution 
is 


(A A (B V C)) V (B A C), 

which uses four devices and has the same delay. Furthermore, 
there is no solution using only three devices; cf. Exercise 1. 

Example 2. Inputs: A and B. Output: T if the inputs agree, F oth¬ 
erwise; i.e., the circuit is to test for equality. Device available: 
two-input NOR. One solution is 

((A 4. A) ; B) ; ((B 4 B) 4. A). 

This uses five devices; is there a better solution? And there is 
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the deeper question: Is there an efficient procedure for finding a 
minimal solution? These are questions that we merely raise here. 
In recent years a great deal of work has gone into investigating 
questions of this type. 

Example 3. (Relay switching) Inputs: A, -i A, B, —i B, ... Devices: 
OR (any number of inputs), AND (any number of inputs). Cost: 
Devices are free, but each use of an input costs one unit. To test 
for equality of A and B we could use 

(A A B) V (-■ A A -i B). 

The wiring diagram for the circuit is shown in Fig. 5. The circuit 
will pass current iff A and B are assigned the same value. (This 
formula, equivalent to A <—► B, has the property that its truth value 
changes whenever the value of one argument changes. For this 
reason, the circuit is used, with double-throw switches, in wiring 
hallway lights.) 



But there is one respect in which relay circuits do not fit the 
description given at the beginning of this section. Relays are bilat¬ 
eral devices; they will pass current in either direction. This feature 
makes “bridge” circuits possible (Fig. 6). The methods described 
here do not apply to such circuits. 



Figure 6. Bridge circuit. 
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Example 4. There are four inputs, and the circuit is to realize the 
Boolean function G, where G is to have the value 7 at (7, 7, F, 
T), (F, F, T, F), ( F , F, T, T }, (F, T, F, F), {F, T, F, T), (F, T, 
T, F), {F , T, T, T), and ( T , F, F, T). G is to have the value 
F at {T, F, F, F), {T, F, T, F), (T, T, F, F), {T, T, T, F), and 
( T , 7, 7, T). At the remaining three points, ( F , F, F , F), ( T , F, 
T , T), and ( T , 7, F, T), we don’t care about the value of G. (The 
application of the circuit is such that these three combinations 
never occur.) 

We know that G can be realized by using, say, { A , V, -i}. But 
we want to do this in an efficient way. The first step is to repre¬ 
sent the data in a more comprehensible form. We can do this by 
means of Fig. 7. Since G(7, 7, 7, 7) = 7, we have placed a 7 in 
the square with coordinates (—i A, —> B, —i C, D). Similarly, there 
is an 7 in the square with coordinates (A, B, -i C, -i D) because 
G(7, 7, 7, 7) = 7. The three squares we do not care about are 
left blank. 



-«->A-*-| 

r*- a -*-j 


c 


T 

F 

F 

1 

-.D 

T 

T 


T 

D 

c 

T 

T 

F 


T 
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F 

F 

i 

-D 

A 


^_„ „ 

(-«-B-*- 

< - > 



Figure 7. Diagram for Example 4. 


Now we look for a simple geometrical pattern. The shaded area 
includes all 7’s and no 7’s. It corresponds to the formula 

(-iA)V(-iCA D), 

which is reasonably simple and meets all our requirements. Note 
that the input B is not needed at all. 
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Exercises 

1. In Example 1 of this section, verify that there is no solution using 
only three devices. 

2. Define a literal to be a wff which is either a sentence symbol or the 
negation of a sentence symbol. An implicant of tp is a conjunction 
a of literals (using distinct sentence symbols) such that a \= <p. We 
showed in Section 1.5 (cf. Corollary 15C) that any satisfiable wff ip 
is tautologically equivalent to a disjunction oq V • • • Va„ where each 
aj is an implicant of ip. An implicant a of tp is prime iff it ceases to be 
an implicant upon the deletion of any of its literals. Any disjunction 
of implicants equivalent to <p clearly must, if it is to be of minimum 
length, consist only of prime implicants. 

(a) Find all prime implicants of 

(A —» B) A (-i A —*■ C). 

(b) Which disjunctions of prime implicants enjoy the property of 
being tautologically equivalent to the formula in part (a)? 

3. Repeat (a) and (b) of Exercise 2, but for the formula 

(A V -i B) A (C V D) —> B A ((A A C) V (C A D)). 

SECTION 1.7 _ 

Compactness and Effectiveness 

Compactness 

We now give a proof of the compactness theorem mentioned earlier 
(Section 1.2). Call a set E of wffs satisfiable iff there is a truth assign¬ 
ment that satisfies every member of E. 

Compactness Theorem A set of wffs is satisfiable iff every finite 
subset is satisfiable. 

Let us temporarily say that E is finitely satisfiable iff every finite 
subset of E is satisfiable. Then the compactness theorem asserts that 
this notion coincides with satisfiability. Notice that if E is satisfiable, 
then automatically it is finitely satisfiable. Also if E is finite, then the 
converse is trivial. (Every set is a subset of itself.) The nontrivial part is 
to show that if an infinite set is finitely satisfiable, then it is satisfiable. 

Proof of the Compactness Theorem. The proof consists of two dis¬ 
tinct parts. In the first part we take our given finitely satisfiable 
set E and extend it to a maximal such set A. In the second part 
we utilize A to make a truth assignment that satisfies E. 
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For the first part let ai, (* 2 ,... be a fixed enumeration of the 
wffs. (This is possible since the set of sentence symbols, and hence 
the set of expressions, is countable; see Theorem OB.) Define by 
recursion (on the natural numbers) 

A 0 = E, 

f A„; cr„ + i if this is finitely satisfiable, 

A„+i = < 

[ A„; -ia!„ + i otherwise. 

(Recall that A„; a„ +1 = A„ U {a„ +1 }.) Then each A„ is finitely 
satisfiable; see Exercise 1. Let A = (J n A„, the limit of the A„’s. 

It is clear that (1) E C A and that (2) for any wff a either 
a e A or (—1 or) e A. Furthermore, (3) A is finitely satisfiable. 
For any finite subset is already a finite subset of some A„ and 
hence is satisfiable. 

This concludes the first part of the proof; we now have a set 
A having properties (l)-(3). There is in general not a unique 
such set, but there is at least one. (An alternative proof of the 
existence of such a A — and one that we can use even if there are 
uncountably many sentence symbols — employs Zorn’s lemma. 
The reader familiar with uses of Zorn’s lemma should perceive 
its applicability here.) 

For the second part of the proof we define a truth assignment 
v for the set of all sentence symbols: 

v(A) = T iff A e A 

for any sentence symbol A. Then for any wff cp, we claim that 
v satisfies <p iff ip e A. 

This is proved by induction on (p; see Exercise 2. Since E C A, 
v must then satisfy every member of E. H 

Corollary 17A If E |= r, then there is a finite E ( , C E such that 
Eo h T. 

Proof. We use the basic fact that E |= r iff E ; —< r is unsatisfiable, 

Eo t for every finite Eo c E 

=>■ Eo; —1 r is satisfiable for every finite Eo c E 
=>■ E; —1 x is finitely satisfiable 
=>■ E; —1 r is satisfiable 
=► E ^ r. 

H 

In fact, the above corollary is equivalent to the compactness theorem; 
see Exercise 3. 



Chapter 1: Sentential Logic 


61 


Effectiveness and Computability 

Although the method of truth tables is rather cumbersome to use, the 
existence of the method yields interesting theoretical conclusions. Sup¬ 
pose we ask of a set E of wffs whether or not there is an effective 
procedure that, given a wff r, will decide whether or not E |= r. By an 
effective procedure we mean one meeting the following conditions: 

1. There must be exact instructions (i.e., a program) explaining how 
to execute the procedure. The instructions must be finite in length. After 
all, it must be possible to give the instructions to the person or machine 
doing the calculations, and one cannot give someone all of an infinite 
object. But we impose no upper limit in advance on exactly how long 
the instructions can be. If there are more lines in the instructions than 
there are electrons in the universe, we merely shrug and say, "That’s a 
pretty long program.” 

2. These instructions must not demand any brilliance or originality 
on the part of the person or machine following them. The idea is that a 
diligent clerk (who knows no mathematics but is very good at following 
directions) or your computing machine (which does not think at all) 
should be able to execute the instructions. That is, it must be possible 
for the instructions to be mechanically implemented. The procedure 
must avoid random devices (such as the flipping of a coin), or any such 
device that can, in practice, only be approximated. 

3. In the case of a decision procedure, as asked for above, the pro¬ 
cedure must be such that, given a wff r, after a finite number of steps 
the procedure produces a “yes” or “no” answer. (That is, the procedure 
must be an algorithm for determining the answer.) 

On the other hand, we place no bound in advance on the amount of 
time that might be required before the answer appears. Nor do we place 
any advance bound on the amount of scratch paper (or other storage 
medium) that might be required. These will depend on, among other 
things, the input r. But for any one r, the procedure is to require only a 
finite number of steps to produce the answer, and so only a finite amount 
of scratch paper will be consumed. It is no fair doing an infinite number 
of steps and then giving the answer. 

People with a digital computing machine may regard a procedure as 
being effective only when it can be carried out on their machine in a 
“reasonable” length of time. Of course, the matter of what is reasonable 
may change with the circumstances. Next year they plan to get a faster 
machine with more memory. At that time, their idea of what can be done 
in a reasonable length of time will be considerably extended. We want 
here a concept of effective procedure that is the limiting case where 
all the practical restrictions on running time and memory space are 
removed. 
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Of course the foregoing description can hardly be considered a pre¬ 
cise definition of the word “effective.” And, in fact, that word will be used 
only in an informal intuitive way throughout this book. (In Chapter 3 
we will meet a precise counterpart, “recursive.”) But as long as we 
restrict ourselves to positive assertions that there does exist an effective 
procedure of a certain sort, the informal approach suffices. We simply 
display the procedure, show that it works, and people will agree that it 
is effective. (But this relies on the empirical fact that procedures which 
appear effective to one mathematician also appear so to others.) If we 
wanted a negative result, that there did not exist an effective procedure 
of a certain sort, then this informal viewpoint would be inadequate. (In 
Chapter 3 we do want to obtain just such negative results.) Because the 
notion of effectiveness is informal, definitions and theorems involving 
it will be marked with a star. For example: 

*Theorem 17B There is an effective procedure that, given any ex¬ 
pression e, will decide whether or not it is a wff. 

Proof. See the algorithm in Section 1.3 and the footnotes thereto. 

H 


There is one technical point here, stemming from the fact that our lan¬ 
guage has infinitely many different sentence symbols. When we speak 
of being “given” an expression s, we imagine that someone might write 
down the symbols in e, one after the other. It is implausible that anyone 
has the potential of writing down any one of an infinitude of symbols. 
To avoid this, we adopt the following “input/output format”: Instead of 
writing down A 5 , for example, we use A"", a string of five symbols. 
Now the total number of symbols in our alphabet is only nine: 

(, ), - 1 , A, V, —►, A, and '. 

(If we identify these nine symbols with the digits 1-9, we get expressions 
that look particularly familiar in computational settings! And we still 
have the 0 digit for separating expressions.) 

Theorem 17B states that the set of wffs is decidable in the sense of 
the following definition: 

^Definition. A set E of expressions is decidable iff there exists 
an effective procedure that, given an expression », will decide 
whether or not a e E. 

For example, any finite set is decidable. (The instructions can simply 
list the finitely many members of the set. Then the algorithm can check 
the input against the list.) Some infinite sets are decidable but not all. 
On the one hand, there are uncountably many (2 K °, to be exact) sets 
of expressions. On the other hand, there can be only countably many 
effective procedures. This is because the procedure is completely deter- 
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mined by its (finite) instructions. There are only K 0 finite sequences of 
letters, and the instructions, when written out, form a finite sequence of 
letters. 

*Theorem 17C There is an effective procedure that, given a finite 
set E; r of wffs, will decide whether or not E |= r. 

Proof. The truth-table procedure (Section 1.2) meets the require¬ 
ment. H 

In this theorem we specified that E; r was finite, since one cannot 
be “given” in any direct and effective way all of an infinite object. 

*Corollary 17D For a finite set E, the set of tautological conse¬ 
quences of E is decidable. In particular, the set of tautologies is 
decidable. 

If E is an infinite set — even a decidable one — then in general its set 
of tautological consequences may not be decidable. (See Chapter 3.) But 
we can obtain a weaker result, which is in a sense half of decidability. 

Say that a set A of expressions is effectively enumerable iff there 
exists an effective procedure that lists, in some order, the members of 
A. If A is infinite, then the procedure can never finish. But for any 
specified member of A, it must eventually (i.e., in a finite length of 
time) appear on the list. 

To give more of a feeling for this concept, we now state an equivalent 
way of formulating it. 

*Theorem 17E A set A of expressions is effectively enumerable iff 
there exists an effective procedure that, given any expression s, 
produces the answer “yes” iff e e A. 

If e ^ A, the procedure might produce the answer “no”; more likely 
it will go on forever without producing any answer, but it must not 
lie to us and produce the answer “yes.” Such a procedure is called a 
semidecision procedure — it is half of a decision procedure: 

’‘'Definition. A set A of expressions is semidecidable iff there exists 
an effective procedure that, given any expression e, produces the 
answer “yes” iff e e A. 

Thus Theorem 17E states that a set is effectively enumerable iff it is 
semidecidable. 

Proof. If A is effectively enumerable, then given any e we can 
examine the listing of A as our procedure churns it out. If and 
when e appears, we say “yes.” (Thus if e ^ A, no answer is ever 
given. It is this that keeps A from being decidable. When s has 
failed to occur among the first 10 10 enumerated members of A, 
there is in general no way of knowing whether s A — in which 
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case one should give up looking — or whether s will occur in the 
very next step.) 

Conversely, suppose that we have the procedure described 
in the theorem. We want to create a listing of A. The idea is 
to enumerate all expressions, and to apply our given procedure to 
each. But we must budget our time sensibly. It is easy enough to 
enumerate effectively all expressions: 

£ 1, £2, £3, ••• • 

Then proceed according to the following scheme: 

1. Spend one minute testing £ 1 , for membership in A (using 
the given procedure). 

2. Spend two minutes testing £ 1 , then two minutes testing £ 2 . 

3. Spend three minutes testing £ 1 , three minutes testing £ 2 , and 
three minutes testing £ 3 . 

And so forth. Of course whenever our procedure produces a 
“yes” answer, we put the accepted expression on the output list. 
Thus any member of A will eventually appear on the list. (It 
will appear infinitely many times, unless we modify the above 
instructions to check for duplication.) H 

Clearly any decidable set is also semidecidable. (We get a semide¬ 
cision procedure even if the “no” bulb has burned out.) Hence any any 
decidable set is effectively enumerable. 

*Theorem 17F A set of expressions is decidable iff both it and its 
complement (relative to the set of all expressions) are effectively 
enumerable. 

Proof. Exercise 8. This result is sometimes called “Kleene’s 
theorem.” H 

Observe that if sets A and B are effectively enumerable, then so are 
AUfi and AD B (Exercise 11). The class of decidable sets is also 
closed under union and intersection, and it is in addition closed under 
complementation. 

Now for a more substantive result: 

*Theorem 17G If £ is a decidable set of wffs, then the set of tau¬ 
tological consequences of E is effectively enumerable. 

Proof. Actually it is enough for £ to be effectively enumerated; 
consider an enumeration 


0 i, 02, 03, • • • • 
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Given any wff r, we can test (by truth tables) successively whether 
or not 

0 \= r, 

{o-l} N r, 

{cxi, cr 2 } |= r, 

{<Ti, ct 2 , er 3 } |= r, 

and so forth. If any of these conditions is met, then we answer 
“yes.” Otherwise, we keep trying. 

This does produce an affirmative answer whenever E |= r, by 
the corollary to the compactness theorem. H 

Later on, we will want to use effective procedures to compute func¬ 
tions. We will say that a function / is effectively computable (or simply 
computable ) iff there exists an effective procedure that, given an input 
x, will eventually produce the correct output f(x). 


Exercises 


1. Assume that every finite subset of E is satisfiable. Show that the 
same is true of at least one of the sets E; a and E; -m. (This is 
part of the proof of the compactness theorem.) Suggestion: If not, 
then Ei; a and E 2 ; —>a are unsatisfiable for some finite Ei C E 
and E2 c E. Look at Ej U E2. 

2. Let A be a set of wffs such that (i) every finite subset of A is 
satisfiable, and (ii) for every wff a, either a e A or (—>a) e A. 
Define the truth assignment v: 


v(A) = 


T iff AeA, 
F iff A i A 


for each sentence symbol A. Show that for every wff (p, v(<p) = T 
iff (p e A. (This is part of the proof of the compactness theorem.) 
Suggestion: Use induction on <p. 

3. Show that from the corollary to the compactness theorem we can 
prove the compactness theorem itself (far more easily than we can 
starting from scratch). 

4. In 1977 it was proved that every planar map can be colored with 
four colors. Of course, the definition of “map” requires that there 
be only finitely many countries. But extending the concept, sup¬ 
pose we have an infinite (but countable) planar map with countries 
Ci, C 2 , C 3 ,.... Prove that this infinite planar map can still be col¬ 
ored with four colors. (Suggestion: Partition the sentence symbols 
into four parts. One sentence symbol, for example, can be used to 
translate, “Country C 7 is colored red.” Form a set Ei of wffs that 
say, for example, C 7 is exactly one of the colors. Form another set 



66 


A Mathematical Introduction to Logic 


£2 of wffs that say, for each pair of adjacent countries, that they 
are not the same color. Apply compactness to £1 U £2-) 

5 . Where £ is a set of wffs, define a deduction from £ to be a finite 
sequence (ao, •••,«„} of wffs such that for each k < n, either (a) 
ai : is a tautology, (b) a* e £, or (c) for some i and j less than k, a, 
is (oij —> oik). (In case (c), one says that a a- is obtained by modus 
ponens from a,- and a,.) Give a deduction from the set 

{ 1S V R, R —y P, S} 

the last component of which is P. 

6. Let (ao, •••,«„} be a deduction from a set £ of wffs, as in the 
preceding problem. Show that £ |= 07,. for each k < n. Suggestion: 
Use strong induction on k, so that the inductive hypothesis is that 
£ |= o/j for all i < k. 

7 . Show that whenever £ |= t, then there exists a deduction from 
£, the last component of which is r. Remark: This result is called 
“completeness”; the concepts in Exercises 5-7 will reappear in 
Section 2 . 4 . 

8. Prove Theorem 17 F. Remark: Two semidecision procedures make 
a whole. 

* 9 . The concepts of decidability and effective enumerability can be 
applied not only to sets of expressions but also to sets of integers 
or to sets of pairs of expressions or integers. Show that a set A of 
expressions is effectively enumerable iff there is a decidable set B 
of pairs (a, n) (consisting of an expression a and an integer n ) such 
that A = dom B. 

10 . Let £ be an effectively enumerable set of wffs. Assume that for each 
wff r, either £ ^ror£ |= —it. Show that the set of tautological 
consequences of £ is decidable, 

(a) Do this where “or” is interpreted in the exclusive sense: either 
£ ^ror£ |= -1 r but not both. 

(b) Do this where “or” is interpreted in the inclusive sense: either 
£ ^ror£ |= —. r or both. 

11 . (a) Explain why the union of two effectively enumerable sets is 

again effectively enumerable. 

(b) Explain why the intersection of two effectively enumerable sets 
is again effectively enumerable. 

12 . For each of the following conditions, give an example of an unsat- 
isfiable set T of formulas that meets the condition. 

(a) Each member of T is — by itself—satisfiable. 

(b) For any two members y\ and 5/2 of T, the set {71, 72} is satisfi¬ 
able. 

(c) For any three members y \, 72, and 73 of T, the set {71, 72, 73} 
is satisfiable. 





w o 







First-Order Logic 





? i- 



SECTION 2.0_ 

Preliminary Remarks 

The preceding chapter presented the first of our 
mathematical models of deductive thought. It was 
a simple model, indeed too simple. It is easy to 
think of examples of intuitively correct deductions 
that cannot be adequately mirrored in the model of 
sentential logic. 

Suppose we begin with a collection of hypothe¬ 
ses (in English) and a possible conclusion. By trans¬ 
lating everything to the language of sentential logic 
we obtain a set E of hypotheses and a possible con¬ 
clusion r. Now if E |= r, then we feel that the orig¬ 
inal English-language deduction was valid. But if 
E r, then we are unsure. It may well be that the 
model of sentential logic was simply too crude to 
mirror the subtlety of the original deduction. 

This chapter presents a system of logic of much 
greater ability. In fact, when the “working mathe¬ 
matician” finds a proof, almost invariably what is 
meant is a proof that can be mirrored in the system 
of this chapter. 

First, we want to give an informal description of 
the features our first-order languages might have 
(or at least might be able to simulate). We begin 
with a special case, the first-order language for 
number theory. For this language there is a certain 
intended way of translating to and from English 
(Table VII). 
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TABLE Vll 


Formal expression 

Intended translation 

0 

“Zero.” Here 0 is a constant symbol, intended to 
name the number 0. 

St 

"The successor of t.” Here S is a one-place function 
symbol, t is to be an expression that names some 
number a. Then St names 5(a). the successor of a. 
For example, SO is intended to name the number 1. 

<ViV 2 

“v l is less than u 2 .” Here < is a two-place predicate 
symbol. At the end of Section 2.1 we will adopt 
conventions letting us abbreviate the expression in 
the more usual style: Vi < Vi. 

V 

“For every natural number.” The symbol V is the 
universal quantifier symbol. More generally, with 
each translation of the language into English there 
will be associated a certain set A (the so-called uni¬ 
verse); V will then become “for every member of 
the universe A.” 

V v { < Oui 

“For every natural number V\, zero is less than V\ 
Or more euphoniously, “Every natural number is 
larger than 0.” This formal sentence is false in the 
intended translation, since zero is not larger than 
itself. 


One abbreviation is mentioned in Table VII. There will be more 
(Table VIII). 

Actually we will not be quite as generous as the tables might suggest. 
There are two economy measures that we can take to obtain simplifica¬ 
tion without any essential loss of linguistic expressiveness: 

First, we choose as our sentential connective symbols just -i and —►. 
We know from Section 1.5 that these form a complete set, so there is no 
compelling reason to use more. 

Secondly, we forego the luxury of an existential quantifier, 3 x. In its 
place we use —i Vx —i. This is justified, since an English sentence like 

There is something rotten in the state of Denmark, 

is equivalent to 

It is not the case that for every x, x is not rotten in the state 
of Denmark. 

Thus the formula 3 v\ V v 2 v i = Vi becomes, in unabbreviated form, 
(-i V -Ui(-i V» 2 = V\ v 2 )). 
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TABLE VIII 

Abbreviated expression 

Intended translation 

x = y 

“x equals y.” In unabbreviated form this will 
become =xy. 

3 v 

"There exists a natural number v such that.” Or 
more generally, “there exists a member of the 
universe such that.” 

3 Vi V v 2 Vi — v 2 

“There is exactly one natural number.” Again 
this formal sentence is false in the intended 

translation. 

V wTO < Vi V 0 = v t ) 

“Every natural number is greater than or equal 
to zero.” 


For an example in an ad hoc language, we might translate “Socrates 
is a man” as Hs, where H is a one-place predicate symbol intended 
to translate “is a man” and s is a constant symbol intended to name 
Socrates. Similarly, to translate “Socrates is mortal” we take Ms. Then 
“All men are mortal” is translated as V V\(H v t —<- Mv\). 

The reader will possibly recognize the symbols V and 3 from previ¬ 
ous mathematical contexts. Indeed, some mathematicians, when writing 
on the blackboard during their lectures, already use a nearly formalized 
language with only vestigial traces of English. That our first-order lan¬ 
guages resemble theirs is no accident. We want to be able to take one 
step back and study not, say, sets or groups, but the sentences of set 
theory or group theory. (The term metamathematics is sometimes used; 
the word itself suggests the procedure of stepping back and examining 
what the mathematician is doing.) The objects you, the logician, now 
study are the sentences that you, the set theoretician, previously used in 
the study of sets. This requires formalizing the language of set theory. 
And we want our formal languages to incorporate the features used in, 
for example, set theory. 


SECTION 2.1 _ 

First-Order Languages 

We assume henceforth that we have been given infinitely many distinct 
objects (which we call symbols), arranged as follows: 

A. Logical symbols 
0. Parentheses: (,). 

1. Sentential connective symbols: —>, -i. 
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2. Variables (one for each positive integer n ): 

V\, v 2 , .... 

3. Equality symbol (optional): =. 

B. Parameters 

0. Quantifier symbol: V. 

1. Predicate symbols: For each positive integer n, some set (pos¬ 
sibly empty) of symbols, called n-place predicate symbols. 

2. Constant symbols: Some set (possibly empty) of symbols. 

3. Function symbols: For each positive integer n , some set (pos¬ 
sibly empty) of symbols, called n-place function symbols. 

In A.3 we allow for the possibility of the equality symbol’s being 
present, but we do not assume its presence. Some languages will have 
it and others will not. The equality symbol is a two-place predicate 
symbol but is distinguished from the other two-place predicate symbols 
by being a logical symbol rather than a parameter. (This status will affect 
its behavior under translations into English.) We do need to assume that 
some n-place predicate symbol is present for some n. 

In B.2, the constant symbols are also called 0-place function symbols. 
This will often allow a uniform treatment of the symbols in B.2 and B.3. 

As before, we assume that the symbols are distinct and that no symbol 
is a finite sequence of other symbols. 

In order to specify which language we have before us (as distinct 
from other first-order languages), we must (i) say whether or not the 
equality symbol is present, and (ii) say what the parameters are. 

We now list some examples of what this language might be: 

1. Pure predicate language 

Equality: No. 

n-place predicate symbols: A", A'j,.... 

Constant symbols: a\,a 2 ,... ■ 

n-place function symbols (n > 0): None, 

2. Language of set theory 

Equality: Yes (usually). 

Predicate parameters: One two-place predicate symbol e. 

Function symbols: None (or occasionally a constant symbol 0). 

3. Language of elementary number theory (as in Chapter 3) 

Equality: Yes. 

Predicate parameters: One two-place predicate symbol <. 

Constant symbols: The symbol 0. 
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One-place function symbols: S (for successor). 

Two-place function symbols: + (for addition), • (for multiplication), 
and E (for exponentiation). 

In examples 2 and 3 there are certain intended translations of the 
parameters. We will presently give a number of examples of sentences 
that can be translated into these languages and a few examples of sen¬ 
tences that cannot be so translated. 

It is important to notice that our notion of language includes the 
language for set theory. For it is generally agreed that, by and large, 
mathematics can be embedded into set theory. By this is meant that 

(a) Statements in mathematics (such as the fundamental theorem of 
calculus) can be expressed in the language of set theory; and 

(b) The theorems of mathematics follow logically from the axioms 
of set theory. 

Our model of first-order logic is fully adequate to mirror this procedure. 

Examples in the language of set theory. Here it is intended that V 
should mean “for all sets” and e should mean “is a member of.” 

1 . “There is no set of which every set is a member.” We will 
translate this into the language of set theory using several steps. 
The intermediate sentences are neither in English nor in the formal 
language but are in a mixed language. 

—■[There is a set of which every set is a member] 

-i 3 v\ [Every set is a member of v\ ] 

—i 3 Vj V Lb 1>2 € V\ 

Although it is tempting to stop here, we must now replace ti 2 e v\ 
by ev2V\ , since predicate symbols will always go at the left in 
such contexts. Furthermore, 3 v\ must be replaced by —i V v\ -i, 
as mentioned earlier. And we must use the correct number of 
parentheses. The finished product is 

en 2 ^i))). 

2. Pair-set axiom: “For any two sets, there is a set whose mem¬ 
bers are exactly the two given sets.” Again we carry out the trans¬ 
lation in stages. 

V V\ V V2 [There is a set whose members are exactly v\ and 
v 2 ] 

V V\ V V2 3 n 3 [The members of v 3 are exactly 14 and v 2 ] 

V V\ V V2 3 11 3 V 14(14 € t> 3 Ot)4=Di Vt)4= V2 ) 

Now we replace 3 14 by -1V 1)3 —1,1)4 e 113 by £14113, and 14 = ti; 
by =11411, . In addition, we must eliminate o- and A in favor of 
our chosen connectives —>• and -1. Thus 

a V f3 becomes —1 a —>• ji\ 
a' o fi becomes -1 ((a —► / 3 ) —► —1 (/l —1«)). 
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The finished product is 

V V V2(—I V «3(—IV tty(“i ((&V4V3 —i► (( -1 = Vi) —► 

= V4U2)) — >• (-> (((— ' =v 4 vi) —► —V4V2) —* ev 4 « 3 ))))))- 

The finished product is not nearly as pleasant to read as the ver¬ 
sion that preceded it. As we have no interest in deliberately making 
life unpleasant for ourselves, we will eventually adopt conventions 
allowing us to avoid seeing the finished product at all. But for the 
moment it should be regarded as an interesting, even if unattractive, 
novelty. 

Examples in the language of elementary number theory. Here it is 
intended that V should mean “for all natural numbers” and that 
<, 0, S, +, •, and E should have the obvious meanings. 

1. As a name for the natural number 2 we have the term SSO, 
since 2 is the successor of the successor of zero. Similarly, for 4 
we have the term SSSSO. For the phrase “2 + 2” it is tempting to 
use SSO + SSO. But we will adopt the policy of always putting 
the function symbol at the left (i.e., we will use Polish notation 
for function symbols). Thus corresponding to the English phrase 
“2 + 2” we have the term + SSO SSO. The English sentence “Two 
plus two is four” is translated as 

= + SSOSSOSSSSO. 

(The spaces are inserted to help the reader, but they do not con¬ 
stitute an official feature of the language.) 

2. “Any nonzero natural number is the successor of some num¬ 
ber.” We will perform the translation in three steps: 

V V\ [If V\ is nonzero, then V\ is the successor of some number.] 

V Vi (vi ^ 0 —► 3 v 2 v\ — Sv 2 ). 

V vi ((-1 =1^0) —► (- 1 V v 2 (—i =v\Sv 2 ))). 

3. “Any nonempty set of natural numbers has a least element.” 
This cannot be translated into our language, because we cannot 
express “any... set.” This requires either something like the (first- 
order) language for set theory or a second-order language for 
number theory. We could, however, translate, “The set of primes 
has a least element.” (The first step is to convert this sentence into, 
“There is a smallest prime.” We leave the other steps to the reader; 
hints can be found in the next section.) 

Examples in ad hoc languages 

1. “All apples are bad.” 


V v\{Av\ —* Bv 1 ). 
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2. “Some apples are bad.” 

Intermediate step: 3 V\(Av\ A Bv\). 

Finished product: (—i V V\ (—i (-i ( Avi —> (—i B «!))))). 

These two examples illustrate patterns that arise continually. 
An English sentence which asserts that everything in a certain 
category has some property is translated 

V v( —> ). 

A sentence which asserts that there is some object or objects in 
the category and having the property is translated 

3 v( A ). 

The reader should be cautioned against confusing the two pat¬ 
terns. For example, 


V Vi(Av t A Bv )) 

translates “Everything is an apple and is bad,” which is a much 
stronger assertion than the sentence in the first example. Similarly, 
3 v\(Av\ —y Bv i) translates “There is something which is bad, if 
it is an apple.” This is a much weaker assertion than the sentence 
in the second example. It is true (vacuously) even if all apples are 
good, provided only that the world has something which is not an 
apple. 

3. “Bobby’s father can beat up the father of any other kid on 
the block.” Establish a language in which V is intended to mean 
“for all people,” Kx is to mean “x is a kid on the block,” b is to 
mean “Bobby,” Bxy is to mean “x can beat up y” and fx is to 
mean “the father of x.” Then a translation is 

V vi(Kvi —> ((-i —vib) —> Bfbfvi)). 

4 . In calculus, we learn the meaning of "the function / con¬ 
verges to L as x approaches a”: 

Ve(e > 0 —> 3 8(8 > 0 A Vx(|x — a\ < 8 —> \ fx — L\ < e))). 

This is, apart from notational matters, a formula of the sort we 
are interested in, using a predicate symbol for ordering, function 
symbols for /, subtraction, and absolute values, and constant 
symbols for 0, a, and L. 

Formulas 

An expression is any finite sequence of symbols. Of course most ex¬ 
pressions are nonsensical, but there are certain interesting expressions: 
the terms and the wffs. 
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The terms are the nouns and pronouns of our language; they are the 
expressions that can be interpreted as naming an object. The atomic 
formulas will be those wffs having neither connective nor quantifier 
symbols. 


terms 


atomic 

formulas 


other all other 

wffs expressions 


wffs 

The terms are defined to be those expressions that can be built up 
from the constant symbols and the variables by prefixing the function 
symbols. To restate this in the terminology of Chapter 1, we define for 
each n-place function symbol /, an /i-place term-building operation Tf 
on expressions: 

.A/ (C | . .... 8 n ) f 8 1 ••*£;(. 


Definition. The set of terms is the set of expressions that can be 
built up from the constant symbols and variables by applying (zero 
or more times) the T / operations. 

If there are no function symbols (apart from the constant symbols), 
then the terms are just the constant symbols and the variables. In this 
case we do not need an inductive definition. 

Notice that we use Polish notation for terms by placing the function 
symbol at the left. The terms do not contain parentheses or commas. We 
will later prove a unique readability result, showing that given a term, 
we can unambiguously decompose it. 

The terms are the expressions that are translated as names of objects 
(noun phrases), in contrast to the wffs which are translated as assertions 
about objects. 

Some examples of terms in the language of number theory are 

+ 17 2 SO, 

SSSSO, 

+Et7 1 SSOEt7 2 SSO. 

The atomic formulas will play a role roughly analogous to that played 
by the sentence symbols in sentential logic. An atomic formula is an 
expression of the form 

Pt\ • •' t n , 

where P is an n -place predicate symbol and t\..... t n are terms. 

For example, =v\V 2 is an atomic formula, since = is a two-place 
predicate symbol and each variable is a term. In the language of set 
theory we have the atomic formula e Vs . 
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Notice that the atomic formulas are not defined inductively. In¬ 
stead we have simply said explicitly just which expressions are atomic 
formulas. 

The well-formed formulas are those expressions that can be built up 
from the atomic formulas by use (zero or more times) of the connective 
symbols and the quantifier symbol. We can restate this in the terminol¬ 
ogy of Chapter 1 by first defining some formula-building operations on 
expressions: 

£-.(k) = (-■/), 

S^(y,S) = (y^S), 

Qi(y) = v ViY- 

Definition. The set of well-formed formulas (wffs, or just formulas) 
is the set of expressions that can be built up from the atomic 
formulas by applying (zero or more times) the operations 
£_>, and Q,-(f = 1,2 ,...). 

For example, on the one hand, -i v 3 is not a wff. (Why?) On the other 
hand, 

Vt>i((-iVu 3 (-. en 3 ni))—>(-iVt>2(en2ni—> 

(-. \/ v 4 (ev 4 v 2 —► (-i ev 4 vi )))))) 
is a wff, as is demonstrated by the following tree: 
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But it requires some study to discover that this wff is the axiom of 
regularity for set theory. 

Free Variables 

Two examples of wffs are V and (—i V vi(—i V V 2 &V 2 V 1 )). But 

there is an important difference between the two examples. The second 
might be translated back into English as 

There is a set such that every set is a member of it. 

The first example, however, can be translated only as an incomplete 
sentence, such as 


Every set is a member of_,. 

We are unable to complete the sentence without knowing what to do 
with vi. In cases of this sort, we will say that V\ occurs free in the wff 
V V 2 ^vo V\ . In contrast, no variable occurs free in (- 1 V v\{— 1 V V 2 &V 2 V 1 )). 
But of course we need a precise definition which does not refer to 
possible translations to English but refers only to the symbols 
themselves. 

Consider any variable x. We define, for each wff a, what it means 
for x to occur free in a. This we do by recursion: 

1. For atomic a, x occurs free in a iff x occurs in (i.e., is a symbol 
of) a. 

2 . x occurs free in (—1 a) iff x occurs free in a. 

3. x occurs free in (a —>- ft) iff x occurs free in a or in ft. 

4. x occurs free in V v, a iff x occurs free in a and x f Vj. 

This definition makes tacit use of the recursion theorem. We can 
restate the situation in terms of functions. We begin with the function h 
defined on atomic formulas: 

h{a) = the set of all variables, if any, in the atomic formula a. 

And we want to extend h to a function h defined on all wffs in such a 
way that 

h(£-,(at)) = h(a), 

/((£_>(a, ft)) = h(a)Uh(ft), 

h(Qj(a)) = h(a) after removing v,-, if present. 

Then we say that x occurs free in a (or that x is a free variable of a) iff x e 
h(a). The existence of a unique such h (and hence the meaningfulness 
of our definition) follows from the recursion theorem of Section 1.4 
and from the fact (proved in Section 2.3) that each wff has a unique 
decomposition. 
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If no variable occurs free in the wff a (i.e., if h(a) = 0), then a is a 
sentence. (The sentences are intuitively the wffs translatable into English 
without blanks, once we are told how to interpret the parameters.) 

For example, V ib(Aib —* Bih) and V m( P —*V v^Qv^) are sen¬ 
tences, but V\ occurs free in (V V\ Av\ —► B V\ ). The sentences are usually 
the most interesting wffs. The others lead a second-class existence; they 
are used primarily as building blocks for sentences. 

In translating a sentence from English, the choice of particular vari¬ 
ables is unimportant. We earlier translated “All apples are bad” as 
V v\(Av\ —► Bv\). We could equally well have used 

V vri{Av 2 i —*■ Bv 27). 

The variable is, in effect, used as a pronoun, just as in English we might 
say, “For any object whatsoever, if it is an apple, then it is bad.” (We 
have incorporated into our language an adequate supply of pronouns: 
iti, it 2 , ....) Since the choice of particular variables is unimportant, 
we will often not even specify the choice. Instead we will write, for 
example, V x(Ax —> Bx), where it is understood that x is some variable. 
(The unimportance of the choice of variable will eventually become a 
theorem.) 

Similar usages of variables occur elsewhere in mathematics. In 

7 

a 'j 

i=1 

i is a “dummy” variable but j occurs free. 

On Notation 

We can specify a wff (or indeed any expression) by writing a line that 
displays explicitly every symbol. For example, 

V V\ ((-1 = 1 ) 1 0) —* hVlhJ -1 =l) 1 St> 2 ))). 

But this way of writing things, while splendidly complete, may not 
be readily comprehensible. The incomprehensibility can be blamed (in 
part) on the simplifications we wanted in the language (such as the lack 
of an existential quantifier symbol). We naturally want to have our cake 
and eat it too, so we now will agree on methods of specifying wffs in 
more indirect but more readable ways. These conventions will let us 
write a line such as 


V viiv\ ^ 0 —> 3 v 2 v\ — Sib) 

to name the same wff as is named by the other line above. 

Note well that we are not changing our definition of what a wff is. 
We are simply conspiring to fix certain ways of naming wffs. In the 
(rare) cases where the exact sequence of symbols becomes important. 



78 


A Mathematical Introduction to Logic 


we may have to drop these new conventions and revert to primitive 
notation. 

We adopt then the following abbreviations and conventions. Here a 
and p are formulas, .v is a variable, and u and t are terms. 

(a V P) abbreviates ((-u«) —* P). 

(a A P) abbreviates ( -i (a —► (-i P))). 

(a ** P) abbreviates ((a —► P) A (P —> a)); i.e., 

(-. ((of — >8) — (08 — «)))). 

3 x a. abbreviates (—i V x( —i a)). 

u = t abbreviates =ut. A similar abbreviation applies to some other 
two-place predicate and function symbols. For example, 2 < 3 abbre¬ 
viates <2 3 and 2 + 2 abbreviates +2 2. 

abbreviates (—i —ut); similarly u t abbreviates (—i <ut). 

For parentheses we will use not only ( and) but also [ and ], etc. And 
we omit mention of just as many as we possibly can. Toward that end 
we adopt the following conventions: 

1. Outermost parentheses may be dropped. For example, V x a —► p 
is (Vx a —► P). 

2. -i, V, and 3 apply to as little as possible. For example, 

-iaA P is ((-■ a) A P), and not -i(o , A/S); 

Vxa—>P is (Via—*•/!), and not Vi(a—► P)\ 

3iaA P is (3xa A P), and not 3i(aA P). 

In such cases we might even add gratuitous parentheses, as in (3 x a) A/1. 

3. A and V apply to as little as possible, subject to item 2. For 
example, 

-iaAjl->]/ is (( -i a) A P) —* y. 

4. When one connective is used repeatedly, the expression is grouped 
to the right. For example, 

a p y is a —* (P —* y). 

Examples of how we can eliminate abbreviations, rewriting the for¬ 
mula in an unabbreviated way that explicitly lists each symbol in 
order: 

1. 3 x(Ax A Bx ) is (—iVx( —i (—i (Ax —> (~i Bx))))). 

But (-i V x(Ax —>- (—1 Bx))) would be an equivalent formula (in any 

reasonable notion of equivalence). 

2. 3 x Ax —* Bx is ((—i V x(—i Ax)) —* Bx). 

3 x(Ax —* Bx) is (-i V x( -i (Ax —> Bx))). 

We will try to use the various alphabets in a systematic way. The 
system is listed below, but there will be occasional exceptions for special 


reasons. 
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Predicate symbols: Uppercase italic letters. Also e, <. 

Variables: t u, v, x, y, z. 

Function symbols: /, g, h. Also S, +, etc. 

Constant symbols: a, b, .... Also 0. 

Terms: m, t. 

Formulas: Lowercase Greek letters. 

Sentences: a, r. 

Sets of formulas: Uppercase Greek letters, plus certain italic letters 
that pretend to be Greek, viz., A (alpha) and T (tau). 

Structures (see the next section): Uppercase German (Fraktur) letters. 


Exercises 

1. Assume that we have a language with the following parameters: 
V, intended to mean “for all things”; N , intended to mean “is a 
number”; I, intended to mean “is interesting”; <, intended to mean 
“is less than”; and 0 , a constant symbol intended to denote zero. 
Translate into this language the English sentences listed below. If 
the English sentence is ambiguous, you will need more than one 
translation. 

(a) Zero is less than any number. 

(b) If any number is interesting, then zero is interesting. 

(c) No number is less than zero. 

(d) Any uninteresting number with the property that all smaller 
numbers are interesting certainly is interesting. 

(e) There is no number such that all numbers are less than it. 

(f) There is no number such that no number is less than it. 

2. With the same language as in the preceding exercise, translate back 
into good English the wff 

V.r(ALv —> lx —► -i Vy(Ny —► Iy —► —>x < >>))• 

In 3-8, translate each English sentence into the first-order lan¬ 
guage specified. (You may want to carry out the translation in several 
steps, as in some of the examples.) Make full use of the notational 
conventions and abbreviations to make the end result as readable 
as possible. 

3. Neither a nor b is a member of every set. (V, for all sets; e, is a 
member of; a , a; b, b.) 

4. If horses are animals, then heads of horses are heads of animals. 
(V, for all things; E, is a horse; A, is an animal; hx, the head of x.) 

5. (a) You can fool some of the people all of the time, (b) You can 
fool all of the people some of the time, (c) You can’t fool all of 
the people all of the time. (V, for all things; P, is a person; T , is a 
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time; Fxy, you can fool x at y. One or more of the above may be 
ambiguous, in which case you will need more than one translation.) 

6 . (a) Adams can’t do every job right, (b) Adams can’t do any job 
right. (V, for all things; J, is a job; a, Adams; Dxy, x can do y 
right.) 

7. (a) Nobody likes everybody, (b) No Democrat likes every Repub¬ 
lican. (V, for all people; Lxy, x likes y; D, is a Democrat; R, is a 
Republican.) 

8 . (a) Every farmer who owns a donkey needs hay. (b) Every farmer 
who owns a donkey beats it. (V, for all things; F, is a farmer; I), is 
a donkey; Oxy, x owns y; H, needs hay; Bxy, x beats y.) 

9. Give a precise definition of what it means for the variable x to occur 
free as the ;th symbol in the wff a. (If x is the zth symbol of a but 
does not occur free there, then it is said to occur bound there.) 

10. Rewrite each of the following wffs in a way which explicitly lists 
each symbol in the actual order: 

(a) 3v\ Pv\ A Pv i. 

(b) V V\ Av\ A Bv\ — y 3 tb —> CVo V Dv 2. 

In each case, say which variables occur free in the wff. 


SECTION 2.2 _ 

Truth and Models 

In sentential logic we had truth assignments to tell us which sentence 
symbols were to be interpreted as being true and which as false. In first- 
order logic the analogous role is played by structures, which can be 
thought of as providing the dictionary for translations from the formal 
language into English. (Structures are sometimes called interpretations, 
but we prefer to reserve that word for another concept, to be encountered 
in Section 2.7.) 

A structure for a first-order language will tell us 

1. What collection of things the universal quantifier symbol (V) 
refers to, and 

2. What the other parameters (the predicate and function symbols) 
denote. 

Formally, a structure 21 for our given first-order language is a function 
whose domain is the set of parameters and such that 1 


1 The symbol "21'’ is the letter A in the German (Fraktur) alphabet. The next two letters 
are 2} and C. 
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1. 21 assigns to the quantifier symbol V a nonempty set |2l| called 
the universe (or domain ) of 21. 

2. 21 assigns to each n -place predicate symbol P an n-ary relation 
P'~ [ C |2l|”; i.e., P 21 is a set of n-tuples of members of the universe. 

3. 21 assigns to each constant symbol c a member c' 1 of the uni¬ 
verse |2l|. 

4. 21 assigns to each n -place function symbol / an n-ary operation 
/ a on |2l|; i.e., / a : |2t|" —»■ |2t|. 

The idea is that 21 assigns meaning to the parameters. V is to mean 
“for everything in |2l|.” The symbol c is to name the point c 21 . The 
atomic formula Pt\- ■ -t n is to mean that the n -tuple of points named by 
t \,..., t n is in the relation P 21 . (We will shortly restate these conditions 
more carefully.) 

Note that we require the universe |2l| to be nonempty. Notice also that 
/ 2l must have all of |2l|" in for its domain; we have made no provision 
for partially defined functions. 

Example. Consider the language for set theory, whose only param¬ 
eter (other than V) is e. Take the structure 21 with 

1211 = the set of natural numbers, 

e' a = the set of pairs (m, n) such that m < n. 

(Thus we translate e as “is less than.”) In the presence of a structure 
we can translate sentences from the formal language into English 
and attempt to say whether these translations are true or false. The 
sentence of this first-order language 

(or more formally, (—i V Vj(-i V vo( -i e V 2 V\)))), which under an¬ 
other translation asserts the existence of an empty set, is now 
translated under 21 into 

There is a natural number such that no natural number 
is smaller, 

which is true. Because of this we will say that 3 x V y —i v E.r is 
true in 21, or that 21 is a model of the sentence. On the other hand, 
21 is not a model of the pair-set axiom, 

V jc V y 3 z V t (f e z ** t — x V t — y), 

as the translation of this sentence under 21 is false. For there is no 
natural number m such that for every n, 

n < m iff n = 1. 

(The reader familiar with axiomatic set theory can check that 21 
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is a model of the extensionality axiom, the union axiom, and the 
axiom of regularity.) 

Example. Again assume the language has only the parameters V 
and a two-place predicate symbol E. But this time, consider the 
finite structure IB with universe 23 consisting of a set of four 
distinct objects {a, b, c, d\. Suppose the binary relation E® is the 
following set of pairs: 

E® = {(a,/?), (b,a), {b,c ), {c,c>}. 

Then we can picture 23 as the directed graph whose vertex-set is 
the universe {a, b, c, d}\ 


a • 




• d 


Here we interpret Exy as saying there is an edge from vertex x to 
vertex y in the graph. (If the binary relation E® had been symmetric, 
then we could have pictured the structure as an undirected graph.) 

Consider now the sentence 3x Vy —i y Ex. Under the structure 
23, we can translate this as follows: 

There is a vertex such that for any vertex, 
no edge points from the latter to the former. 

(The English version is harder to read than the symbolic one!) This 
sentence is true in 23, because no edge points to the vertex d. 

In the preceding examples it was intuitively pretty clear that certain 
sentences of the formal language were true in the structure and some 
were false. But we want a precise mathematical definition of “a is true 
in 21.” This should be stated in mathematical terms, without employing 
translations into English or a supposed criterion for asserting that some 
English sentences are true while the others are false. (If you think you 
have such a criterion, try it on the sentence “This sentence is false.”) In 
other words, we want to take our informal concept of “er is true in 21” 
and make it part of mathematics. 

In order to define “cr is true in 21,” 

l=a 

for sentences a and structures 21, we will find it desirable first to define 
a more general concept involving wffs. Let 

q be a wff of our language, 

21 a structure for the language. 
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s : V -* 1211 a function from the set V of all variables into the universe 
|2t| of 21. 

Then we will define what it means for 21 to satisfy cp with s, 

l=a <Phl- 

The informal version is 

ha if an d only if the translation of q> determined by 21, where 
the variable x is translated as .v (x) wherever it occurs free, is true. 

The formal definition of satisfaction proceeds as follows: 

I. Terms. We define the extension 

s : T -» |2l|, 

a function from the set T of all terms into the universe of 21. The idea 
is that s(t) should be the member of the universe |2t| that is named by 
the term t.s is defined by recursion as follows: 

1. For each variable x, s(x) — s(x). 

2. For each constant symbol c, 1(c) = c a . 

3. If p...., t„ are terms and / is an / 2 -place function symbol, then 

s(fti ■ ■ ■ t n ) = f*(s(h),s(t„)). 


A commutative diagram, for n— 1, is 

T— 


The existence of a unique such extension s of ^ follows from the recur¬ 
sion theorem (Section 1.4), by using the fact that the terms have unique 
decompositions (Section 2.3). Notice that s depends both on ,v and on 21. 
(In fact a reasonable alternative notation for s(t) would be r l (.v|, which 
explicitly displays the dependence on 21.) 

II. Atomic formulas. The atomic formulas were defined explicitly, 
not inductively. The definition of satisfaction of atomic formulas is 
therefore also explicit, and not recursive. 

1 . (= a =f 1 t 2 [s] iff Kfi) = s(t 2 ). 

(Thus = means =. Note that = is a logical symbol, not a parameter 
open to interpretation.) 

2. For an n -place predicate parameter P , 

1=21 Pt\ - t n [.S'] iff (s(ti),s(t „)) e P a . 
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III. Other wffs. The wffs we defined inductively, and consequently 
here satisfaction is defined recursively. 


1. For atomic formulas, the definition is above, 

2 . 

3. |= 2 i (<p —> \jr)[s] iff either ^[s] or |= 2 i lA [j] or both. 

(In other words, if 21 satisfies (p with s then 21 satisfies \[r with s.) 

4. |=a V.r (p[s] iff for every d e |2t|, we have \=% <p[s(x | r/)]. 


Here s{x d) is the function which is exactly like 5 except for one thing: 
At the variable x it assumes the value d. This can be expressed by the 
equation 


s(x | d)iy) = 


s(y) if y^x, 

d if y — x. 


(Thus V means “for all things in |2t|.”) 

At this point the reader might want to reconsider the informal version 
of |= 2 (. <p[s] on page 83 and observe how it has been formalized. 

We should remark that the definition of satisfaction is another ap¬ 
plication of the recursion theorem together with the fact that the wffs 
have unique decompositions. The definition can be restated in terms of 
functions to make it clearer how the recursion theorem of Section 1.4 
applies: 


(i) Consider one fixed 21. 

(ii) Define a function h (extending a function h defined on atomic 
formulas) such that for any wff <p , h((p ) is a set of functions from V 
into |2l|. 

(iii) Define 

|= 2 i tp[s] iff s e Ti{q>). 


We leave to the reader the exercise of writing down the explicit 
definition of h and the clauses that uniquely determine its extension 
h. (See Exercise 7.) An elegant alternative is to have h((p ) be a set of 
functions on the set of those variables that occur free in (p. 

Example. Assume that our language has the parameters V, P (a two- 
place predicate symbol), / (a one-place function symbol), and c (a 
constant symbol). Let 21 be the structure for his language defined 
as follows: 

|2l| = N, the set of all natural numbers, 

P' 1 - — the set of pairs (m, n) such that m < n, 

/ a = the successor function S\ f % {n ) — n + 1, 
c a = 0. 


We can summarize this in one line, by suppressing the fact that 21 



Chapter 2: First-Order Logic 


85 


is really a function and merely listing its components: 

51= (N; <, 5,0). 

This notation is unambiguous only when the context makes clear 
just which components go with which parameters. 

Let s : V —> N be the function for which .v ( v,) = i — 1; i.e., 
= 0 , s(v 2 ) = 1 , and so forth. 

1. s(ffv 3 ) = 5(5(2)) = 4 and J(.M) = 5(0) = 1. 

2. 5(c) = 0 and s(ffc) = 2; no use is made of s. 

3. |= 2 i Pcfv\[s]. This is informally obvious, since when we 
translate back into English we get the true sentence “0 < 1.” More 
formally, the reason is that 

(m,s(fvi)) = (0, 1) e P a . 

4. [=21 V V\ Pcv\. The translation into English is “0 is less 
than or equal to any natural number.” Formally we must verify 
that for any n in N, 

1=21 Pcv\[s(vx |«)], 

which reduces to 

(0, n) e P a , 

i.e., 0 < n. 

5. V V\ [^] because there is a natural number m such 

that 


Pv 2 v l [s(vi | m)]\ 
i.e., 

{s(v 2 ),m) i P a . 

In fact, since .v ( v 2 ) = I, we must take m to be 0. 

The reader is to be cautioned against confusing, for example, 
the function symbol f with the function / a . 

Example. Previously we considered the structure 03 with 

|Q3| = {a, b, c, d] and E® = {{a, b), (b, a), {b, c), {c, c)} 

for the language with parameters V and E: 

c 

>- • »d 

o 



a • 
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Then 1=® V ib -' Ev 2 Vi[s] iff s(«i) — d. That is, there is no 
edge pointing to the vertex d, but d is the only such vertex. Tak¬ 
ing the negation of the formula, we have |=qj 3 V 2 Ev 2 V\[s'\ iff 
•s(iti) e {a, b, c}. 

At this point we pause to verify that when we want to know whether 
or not a structure 51 satisfies a wff <p with s, we do not really need all 
of the (infinite amount of) information s gives us. All that matter are 
the values of the function s at the (finitely many) variables which occur 
free in <p. In particular, if <p is a sentence, then s does not matter at all. 

Theorem 22A Assume that .V] and S 2 are functions from V into 21 
which agree at all variables (if any) that occur free in the wff <p. 
Then 


l=2i Pl>i] iff hoi ^[521- 

Proof. Because satisfaction was defined recursively, this proof 
uses induction. We consider the fixed structure 21 and show by 
induction that every wff cp has the property that whenever two 
functions ,s’i, .S '2 agree on the variables free in <p then 21 satisfies <p 
with Ji iff it does so with S 2 - 

Case 1: <p — Pt\ • ■ ■ t„ is atomic. Then any variable in <p occurs 
free. Thus ,V| and s 2 agree at all the variables in each t,. It follows 
that 51 (tj ) —J 2 (tj) for each i ; a detailed proof would use induction 
on tj. Consequently, 21 satisfies Pt\ ■ ■ ■ t n with ,V| iff it does so 

With 52- 

Cases 2 and 3: <p has the form -iaora —> fi. These cases are 
immediate from the inductive hypothesis. 

Case 4: cp — x ip-. Then the variables free in <p are those free 
in t// with the exception of x. Thus for any d in |2t|, 5i(.r | d) and 
S 2 (x | d) agree at all variables free in x[r. By inductive hypothesis, 
then, 21 satisfies i jr with 5i (x \ d) iff it does so with S 2 {x \d). From 
this and the definition of satisfaction we see that 21 satisfies V .r i jr 
with j! iff it does so with S 2 - 3 

In effect, the above proof amounts to looking through the definition 
of satisfaction and seeing what information given by s was actually 
used. There is an analogous fact regarding structures: If 21 and 23 agree 
at all the parameters that occur in <p, then \=<& <p[s] iff |=fg (p[s\. 

This theorem justifies the following notation: Suppose that <p is a 
formula such that all variables occurring free in (p are included among 
V\,, Vk. Then for elements ai,..., a* of |2l|, 


1=21 <PD>1, • • • - fl/tll 
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means that 21 satisfies tp with some (and hence with any) function 
s : V-> |2t| for which s(Vj) = a,, 1 < i < k. To return to a recent 
example where 21 = (N; <, S, 0), we have |= 2 i V v 2 Pv\ v 2 lOJ but 
V v 2 Pn 1 n 2 E5]|. 

Corollary 22B For a sentence a, either 

(a) 21 satisfies a with every function s from V into |2l|, or 

(b) 21 does not satisfy a with any such function. 

If alternative (a) holds, then we say that a is true in 21 (written |= 2 i a) 
or that 21 is a model of o. And if alternative (b) holds, then of course 
a is false in 21. (They cannot both hold since |2l| is nonempty). 21 is a 
model of a set £ of sentences iff it is a model of every member of £. 

Example. If is the real field, (JR; 0, 1, +, x), and £3 is the rational 
field, (Q; 0 , 1 , +, x), is there a sentence true in one and false in 
the other? Yes; because V2 is irrational, the sentence 3 x(x • x — 
1 + 1 ) is false in the rational held, but true in the real held. 

Example. Suppose our given language has only the parameters V 
and P , where P is a two-place predicate symbol. Then a structure 
21 is determined by the universe |2l| and the binary relation P' 1 . 
With some minor illegality we again write 

21 = (| 2 l|; P’ a ). 

Now consider the problem of characterizing the class of all models 
of the following sentences: 

1. VxVyx = y. A structure (A; R) is a model of this iff A 
contains exactly one element. R can either be empty or can be the 
singleton A x A. 

2. Vi V y Pxy. A structure (A; R) is a model of this iff R = 
A x A. A can be any nonempty set. 

3. ViVy-i Pxy. A structure (A; R) is a model of this iff 
R = 0. 

4. V x 3 y Pxy. The condition for (A; R) to be a model of this 
is that the domain of R is A. 

The notational conventions adopted earlier were done in a ra¬ 
tional way: 

1. |= 2 ! (a A /S)[y] iff |= 2 i q;[y] and ^21 /V[v]; similarly for V 
and 

2. |= 2 i 3 x q;[y] iff there is some d e |2l| with the property that 
1=21 U[S{X | rf)]. 
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The proof for the second of these is as follows: 

|= 2 i 3.v a [x] iff |=gi —i V.r —ia[s], 
iff Vx-.a[s], 
iff it is not the case that for all d in |2l|, 

|= 2 i -'a[s(.r | d)], 

iff it is not the case that for all d in |2l|, 

^ 2 i u[s(x | d)], 

iff for some d in |2l|, [=21 a[s(x | r/)]. 

Logical Implication 

We now have the necessary tools to formulate the important concept of 
logical implication for our language. 

Definition. Let T be a set of wffs, <p a wff. Then T logically implies 
<p, written T |= cp, iff for every structure 21 for the language and 
every function s : V -> |2t| such that 21 satisfies every member 
of T with s, 21 also satisfies <p with s. 

We use the same symbol, “|=,” that was used in Chapter 1 for tau¬ 
tological implication. But henceforth it will be used solely for logical 
implication. As before we will write “y |= in place of “{/} |= <p.” 
Say that <p and i fr are logically equivalent (yp |==| i//) iff <p |= i/r and 
t N < P - 

The hrst-order analogue of the concept of a tautology is the concept 
of a valid formula: A wff <p is valid iff 0 |= <p (written simply “|= <p”). 
Thus <p is valid iff for every 21 and every s : V -» |2t|, 21 satisfies <p 
with s. 

For sentences, the concept of logical implication can be stated more 
concisely, by applying Theorem 22A: 

Corollary 22C For a set E ; r of sentences, E (= r iff every model 
of E is also a model of r. A sentence r is valid iff it is true in 
every structure. 

Examples of logical implication. Readers are invited to convince 
themselves of each of the following: 


1. V vi Qv i |= Qv 2 . 

2. Qv\ V V\ Qv\. Here it suffices to find just one structure 
21 and just one function s : V |2l| such that on the one hand, 
l= 2 t Q v i [s] but, on the other hand, 21 is not a model of V W] Qv\ . 
|2t| will need to have at least two members. 

3. |= —i —i er —> a. If 21 is a model of —i —i a, then ^=21 ~ 11 o' 
whence |= 2 i o. But one might raise the objection: Aren’t we 



Chapter 2: First-Order Logic 


89 


using here the law of double negation, the law we are claiming 
to prove? The answer is a definite yes and no. We are proving 
the law of double negation for the formal language we are talking 
about (sometimes called the object language). In doing so, we can 
of course use any correct reasoning (out in the meta-language, 
English), exactly as we might do in reasoning about vector spaces 
or graphs. In particular, the reasoning might involve principles 
that when formally modeled would involve —i —i cr and a. There 
is no circularity. But the meta-language sentences we use are — 
unsurprisingly — related to the object language formulas we talk 
about. In this connection, see the book’s only picture (at the end 
of Section 2.4). 

4. V V\ Qv\ |= 3 «2 Q W• Recall that the universe of any 
structure is nonempty. 

5. 3x Vy Pxy \= Vy 3x Pxy. This example will come up 
again in Section 2.4. 

6. Vy 3 x Pxy 3xVy Pxy. 

7. |= 3x(Qx —V.i Qx). This is a strange — but valid — 
sentence. 

The definition of logical implication is very much like the definition 
of tautological implication in Chapter 1. But there is an important dif¬ 
ference of complexity. Suppose in sentential logic you want to know 
whether or not a wff a is a tautology. The definition requires that you 
consider finitely many truth assignments, each of which is a finite func¬ 
tion. For each such truth assignment v, you must calculate Ufa), which 
can be effectively done in a finite length of time. (Consequently, the set 
of tautologies is decidable, as was observed before.) 

In contrast to the Unitary procedure for tautologies, suppose that you 
want to know whether or not a wff q> (of our first-order language) is 
valid. The definition requires that you consider every structure 21. (In 
particular this requires using every nonempty set, of which there are a 
great many.) For each of these structures, you then must consider each 
function s from the set V of variables into |2l|. And for each given 21 
and s , you must determine whether or not 21 satisfies <p with s. When 
|2t| is infinite, this is a complicated notion in itself. 

In view of these complications, it is not surprising that the set of 
valid formulas fails to be decidable (cf. Section 3.5). What is surpris¬ 
ing is that the concept of validity turns out to be equivalent to another 
concept (deducibility) whose definition is much closer to being Uni¬ 
tary. (See Section 2.4.) Using that equivalence we will be able to show 
(under some reasonable assumptions) that the set of validities (i.e., the 
set of valid wffs) is effectively enumerable. The effective enumeration 
procedure yields a more concrete characterization of the set of valid 
formulas. 
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Definability in a Structure 

Suppose we want to study the real field, (R; 0. 1, +, ■), consisting of the 
set R of real numbers, together with the distinguished elements 0 and 1 
and the two operations of addition and multiplication. We can consider 
the real field as a structure 

m= (R; 0, 1,+, •) 

where the language (with equality) has constant symbols 0 and 1 and 
two-place function symbols + and •. 

Although we have not included an ordering symbol < in the lan¬ 
guage, we still have a way to say “x > 0.” Because in this structure, the 
nonnegative elements are exactly the elements with square roots. That 
is, the formula 3 v 2 x = v 2 ■ v 2 is satisfied in the structure IK whenever 
x is assigned a nonnegative number, and only then: 

|= 9 i 3 v 2 Vi — v 2 ■ v 2 Ifl]] <^=A a > 0. 

Because of this fact, we will say that the interval [0, oo) is definable in 
IK, and that the formula 3 v 2 V\ — v 2 • v 2 defines it. 

Moreover, the ordering relation on the reals, i.e., the binary relation 

{{a, b) e R x R | a < b}, 

is defined in the structure IK by the formula expressing “v\ < v 2 ”\ 

3l>3 V 2 — V\ + v 2 • v 2 . 

For a smaller example, take the directed graph 
21= ({ a,b,c}\ {( a,b >, (a,c)}) 
where the language has parameters V and E: 

a 

b c 

Then in 21, the set {b, c) (the range of the relation E a ) is defined by the 
formula 3 v 2 Ev 2 V\. In contrast, the set {/;) is not definable in 21. This 
is because there is no definable property in this structure that would 
separate b and c; the proof of this fact will utilize the homomorphism 
theorem, to be proved later in this section. 

We now want to set forth precisely this concept of definability of 
a subset of the universe or of a relation on the universe. Consider a 
structure 21 and a formula tp whose free variables are among V\,..., v k . 
Then we can construct the k-ary relation on |2l| 

{(<?!, ...,a k ) | [= 2 i tpla x , ...,a k J}. 

Call this the A:-ary relation tp defines in 21. In general, a A'-ary relation 
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on 1211 is said to be definable in 21 iff there is a formula ( whose free 
variables are among V\, , %) that defines it there. 

Example. Assume that we have a part of the language for number 
theory, specifically that our language has the parameters V, 0, S, 
+, and •. Let 91 be the intended structure: 

|91| = N, the set of natural numbers. 

O' 31 = 0, the number 0. 

S' 31 , + ' ! , and are S, +, and •, the functions of successor, 

addition, and multiplication. 


In one equation, 

91= (N; 0, S, +, •)• 

Some relations on N are definable in 91 and some are not. One 
way to show that some are not definable is to use the fact that 
there are uncountably many relations on N but only countably 
many possible defining formulas. (There is, however, an inherent 
difficulty in giving a specific example. After all, if something is 
undefinable, then it is hard to say exactly what it is! Later we 
will get to see a specific example, the set of Godel numbers of 
sentences true in 91; see Section 3.5.) 

1. The ordering relation {{m, n) \ m < n) is defined in 91 by 
the formula 


3 Vj V\ + Sl >3 = v 2 . 

2. For any natural number n , {«} is definable. For example, {2} 
is defined by the equation 

til = sso. 

Because of this we say that n is a definable element in 91. 

3. The set of primes is definable in 91. We could use the formula 

1 < Vi A V v 2 V V 2 (Vx — v 2 • v 2 —► v 2 — 1 V 1>3 = 1) 

if we had parameters 1 and < for 1 and <. But since {1} and < 
are definable in 91, it is really quite unnecessary to add parameters 
for them; we can simply use their definitions instead. Thus the set 
of primes is definable by 

3 t >3 SO + St >3 — v\ A V v 2 V i >3 ( v\ — v 2 • v 2 —*- 

v 2 — SO V v 2 — SO). 

4. Exponentiation, {{m, n. p)\ p — m"} is also definable in 
91. This is by no means obvious; we will give a proof later (in 
Section 3.8) using the Chinese remainder theorem. 
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In fact, we will argue later that any decidable relation on N is definable 
in 91, as is any effectively enumerable relation and a great many others. 
To some extent the complexity of a definable relation can be measured 
by the complexity of the simplest defining formula. This idea will come 
up again at the end of Section 3.5. 

Definability of a Class of Structures 

Many a mathematics class, on its first day, begins with the instructor 
saying something like one of the following: 

1. “A graph is defined to consist of a nonempty set V together with 
a set E such that. ...” 

2. “A group is defined to consist of a nonempty set G together with 
a binary operation o satisfying the axioms. ...” 

3. “An ordered field is defined to consist of a nonempty set F together 
with two binary operations + and • and a binary relation < satisfying 
the axioms. ...” 

4. “A vector space is defined to consist of a nonempty set V together 
with a binary operation + and, for each real number r, an operation 
called scalar multiplication such that. ...” 

We want to abstract from this situation. In each case, the objects of 
study (the graphs, the groups, and so forth) are structures for a suitable 
language. Moreover, they are required to satisfy a certain set E of sen¬ 
tences (referred to as “axioms”). The course in question then studies the 
models of the set E of axioms — or at least some of the models. 

For a set E of sentences, let Mod E be the class of all models of E, 
i.e., the class of all structures for the language in which every member 
of E is true. For a single sentence r we write simply “Mod r” instead 
of “Mod {r}.” (The reader familiar with axiomatic set theory will notice 
that Mod E, if nonempty, is a proper class; i.e., it is too large to be a set.) 

A class 1C of structures for our language is an elementary class (EC) 
iff ]C — Mod r for some sentence r. 1C is an elementary class in the 
wider sense (ECa) iff JC = Mod E for some set E of sentences. (The 
adjective “elementary” is employed as a synonym for “first-order.”) 

Examples 

1. Assume that the language has equality and the two pa¬ 
rameters V and E, where £ is a two-place predicate symbol. 
Then a graph is a structure for this language 21 = (V; E % ) 
consisting of a nonempty set V of objects called vertices (or 
nodes), and an edge relation E 21 that is symmetric (if uE^v then 
vE'fi) and irreflexive (never vE' l v). The axiom stating that the 
edge relation is symmetric and irreflexive can be translated by 
the sentence 
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Vx(~i xEx A V y{xEy —> yEx)). 

So the class of all graphs is an elementary class. For directed 
graphs or digraphs, the assumption of symmetry is dropped. And 
if one wants to allow “loops” then the assumption of irreflexivity 
is dropped. But perhaps the instructor then explains that the course 
will study only finite graphs. Is the class of all finite graphs an 
elementary class? No, we will prove later that it is not, not even 
in the wider sense. 

2. Assume that the language has equality and the parameters 
V and P, where P is a two-place predicate symbol. As before, a 
structure (A; R) for the language consists of a nonempty set A 
together with a binary relation R on A. (A; R) is called an ordered 
set iff R is transitive and satisfies the trichotomy condition (which 
states that for any a and b in A, exactly one of (a, b) e R, a = b, 
(b,a) e R holds). Because these conditions can be translated into 
a sentence of the formal language, the class of nonempty ordered 
sets is an elementary class. It is, in fact. Mod r, where r is the 
conjunction of the three sentences 

V x V y V z(x Py —> yPz —> xPz)\ 

V jc V y(xPy V x — y V yPx ); 

V jc V y(xPy — * yPx). 

The next two examples assume that the reader has had some 
contact with algebra. 

3. Assume that the language has = and the parameters V and 
o, where o is a two-place function symbol. The class of all groups 
is an elementary class, being the class of all models of the con¬ 
junction of the group axioms: 

V x V y V z (x o y) o z = x o (y o z); 
xfi y3zx o z = y\ 

Vx Vy 3zz ox = y. 

The class of all infinite groups is ECa- To see this, let 

A 2 — 3 x 3 y x y, 

A 3 = 3 x 3 y 3 z(x ^yAx^zAy^z), 


Thus k„ translates, “There are at least n things.” Then the group 
axioms together with {A. 2 , A. 3 , ...} form a set E for which Mod E 
is exactly the class of infinite groups. We will eventually (in Sec¬ 
tion 2.6) be able to show that the class of infinite groups is not EC. 

4. Assume that the language has equality and the parameters 
V, 0,1, +, •• Fields can be regarded as structures for this language. 
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The class of all fields is an elementary class. The class of fields of 
characteristic zero is EC a- It is not EC, a fact which will follow 
from the compactness theorem for first-order logic (Section 2.6 
again). 

Homomorphisms 1 

In courses about graphs or groups or vector spaces, one usually encoun¬ 
ters the concept of what it means for two of the structures in question, 21 
and 25, to be isomorphic : Roughly speaking, there must be a one-to-one 
correspondence between their universes |2l| and 23 that “preserves” 
the operations and relations. 

It is then explained that two isomorphic structures, although not 
identical, must have all the same mathematical properties. We want to 
define here the isomorphism concept in a general setting, and to show 
that two isomorphic structures must satisfy exactly the same sentences. 

Let 21, IB be structures for the language. A homomorphism h of 21 
into 23 is a function h : |2l| -» 1$! with the properties: 

(a) For each n -place predicate parameter P and each;; -tuple {a\, ... , 
a„) of elements of |2l|, 

(«i,..., a„) e P' [ iff (fi(fli), • • ■, h(a„)) e P®. 

(b) For each n -place function symbol / and each such n -tuple, 

..., a n )) = ..., h(a n )). 

In the case of a constant symbol c this becomes 

h(c a ) = c®. 

Conditions (a) and (b) are usually stated: “h preserves the relations 
and functions.” (It must be admitted that some authors use a weakened 
version of condition (a); our homomorphisms are their “strong homo¬ 
morphisms.”) 

If, in addition, h is one-to-one, it is then called an isomorphism (or 
an isomorphic embedding ) of 21 into 23. If there is an isomorphism of 
21 onto 18 (i.e., an isomorphism h for which ran h — |23|), then 21 and 
23 are said to be isomorphic (written 21 = 23). 

The reader has quite possibly encountered this concept before in 
special cases such as structures that are groups or fields. 


1 This topic can be postponed somewhat. But homomorphisms will be used in the proof 
of the completeness theorem (with equality). And we make use of the isomorphism 
concept, starting in Section 6 of Chapter 2. 



Chapter 2: First-Order Logic 


95 


Example. Assume that we have a language with the parameters V, 
+, and •. Let 21 be the structure (N; + , ■). We can define a function 
/z : N —> {e, o} by 

, . . f e if n is even, 

Hn) = \o if n is odd. 

Then h is a homomorphism of 21 onto 23, where |23| = {e, o] and 
+' B , are given by the following tables: 


+ <B 

e 

o 


e 

o 

e 

e 

o 

e 

e 

e 

o 

o 

e 

o 

e 

o 


It can then be verified that condition (b) of the definition is satis¬ 
fied. For example, if a and b are both odd numbers, then h ( a+b ) = 
e and h(a) + B h{b) — o + B o — e. 

Example. Let P be the set of positive integers, let < p be the usual 
ordering relation on P, and let < , v be the usual ordering relation 
on N. Then there is an isomorphism h from the structure (P; </>) 
onto (N; <n)', we take h(n) = n — 1. Also the identity map 
Id : P —> N is an isomorphism of (P; </>)into(N; <;v).Because 
of this last fact, we say that (P; <p) is a substructure of (N; <n)- 

More generally consider two structures 21 and 23 for the language 
such that |2t| c 231. It is clear from the definition of homomorphism 
that the identity map from |2l| into |23| is an isomorphism of 21 into 
25 iff 

(a) P % is the restriction of P B to |2l|, for each predicate param¬ 
eter P\ 

(b) f' 2[ is the restriction of /® to |2l|, for each function symbol /, 
and r' ( = c® for each constant symbol c. 

If these conditions are met, then 21 is said to be a substructure of IB, 
and 23 is an extension of 21. 

For example, in a language with a two-place function symbol +, the 
structure (Q; +g) is a substructure of (C; +c)- Here +c is the addition 
operation on complex numbers. And +g, addition on the rationals, is 
exactly the restriction of +c to the set Q. 

In this example, the set Q is closed under +c; that is, the sum of two 
rational numbers is rational. More generally, whenever 21 is a substruc¬ 
ture of 25, then |2l| must be closed under /® for every function symbol 
/. After all, / B (fl) (where a e |2l|") is nothing but / a (a), which must 
be some element in 12l|. This closure property even holds for the 0-place 
function symbols; c® must belong to |2l| for each constant symbol c. 
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Conversely, suppose we have a structure 18, and let A be any non¬ 
empty subset of 1251 that is closed under all of IB’s functions, as in 
the preceding paragraph. Then we can make a substructure of '8 with 
universe A. In fact there is only one way to do this. The universe is A, 
each predicate parameter P is assigned the restriction of P'~' to A, and 
similarly for the function symbols. As an extreme case, if the language 
has no function symbols at all (not even constant symbols), then we can 
make a substructure out of any nonempty subset A of | *81. 

These are basically algebraic concepts, but the following theorem 
relates them to the logical concepts of truth and satisfaction. 

Homomorphism Theorem Let h be a homomorphism of 21 into 23, 
and let s map the set of variables into |2t|. 

(a) For any term t, we have h(s(t)) — ho sit), where ~s{t) is 
computed in 21 and h o sit ) is computed in 25. 

(b) For any quantifier-free formula a not containing the equal¬ 
ity symbol, 

|= 3 l a[s] iff |=35 a[hos]. 

(c) If h is one-to-one (i.e., is an isomorphism of 21 into *8), 
then in part (b) we may delete the restriction “not containing the 
equality symbol.” 

(d) If h is a homomorphism of 21 onto 23, then in (b) we may 
delete the restriction “quantifier-free.” 

Proof. Part (a) uses induction on t: see Exercise 13. Note that h os 
maps the set of variables into |23|; its extension to the set of all 
terms is h o s. It is h o s that is here being evaluated at t. 

(b) For an atomic formula such as Pt, we have 

hat Pt[s] O HO e P 21 

O h(s(t)) e P® since h is a homomorphism 
<S> h o sit) e P® by (a) 

|=a Pt[h o s]. 

An inductive argument is then required to handle the connective 
symbols -i and —>- , but it is completely routine, 

(c) In any case, 

|= 2 t u = f[s] siu) = sit) 

=>■ hisiu)) = hisit)) 

h o s(u) = h o sit) by (a) 

<£> |=93 u — f U l ° s ]- 

If h is one-to-one, the arrow in the second step can be reversed. 

(d) We must extend the routine inductive argument of part (b) 
to include the quantifier step. That is, we must show that if <p has 
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the property that for every s, 

Na (p[s] h® <p[h ° i], 

then V x cp enjoys the same property. We have in any case (as a 
consequence of the inductive hypothesis on (p) the implication 

|=23 V x <p[h os]=> [=21 V.r (pis']. 

This is intuitively very plausible; if (p is true of everything in the 
larger set |Q3|, then a fortiori it is true of everything in the smaller 
set ran h. The details are, for an element a of |2l|, 

1=23 Vx (p[h o s]=^ |=23 (p[(h o s)(x | h(a))] 

|=<s (p[h o (s(x | a))], the functions 
being the same 

<£> |=2| tp[s{x | a)] by the inductive 

hypothesis. 

Now for the converse, suppose that ^23 Vxi p[h o s], so that 
N*b ~ 11 (pl(h o s)(.r | b )] for some element b in |*B|. We need the 
implication 

(*)// for some b in |®|, [=23 ~ 1 (p[(h o s)(x \ b)], then for 
some a in |2l|, 1=23 <Pl(h o s)(s \ h(a ))]. 


For given (*), we can proceed: 


[=23 -1 <p[(h o s)(x I h(a))] [=23 -1 (p[h o (s(x I a))], 

O Na ^(p[s{x | a)] 

=> Vx(p[s]. 


the functions 
being the same 
by the inductive 
hypothesis 


If h maps |2l| onto |*8|, then (*) is immediate; we take a such that 
b — h{a). (But there might be other fortunate times when (*) can 
be asserted even if h fails to have range 1031.) H 


Two structures 21 and 18 for the language are said to be elementarily 
equivalent (written 21 = 18) iff for any sentence cr, 

1=21 a O 1=23 cr- 

Corollary 22D Isomorphic structures are elementarily equivalent: 
2 t = *8 => 21 = *8 


Actually more is true. Isomorphic structures are alike in every “struc¬ 
tural” way; not only do they satisfy the same first-order sentences, they 
also satisfy the same second-order (and higher) sentences (i.e., they are 
secondarily equivalent and more). 

There are elementarily equivalent structures that are not isomorphic. 
For example, it can be shown that the structure (R; <«) consisting of 
the set of real numbers with its usual ordering relation is elementarily 
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equivalent to the structure (Q; <q ) consisting of the set of rational 
numbers with its ordering (see Section 2.6). But Q is a countable set 
whereas R is not, so these structures cannot be isomorphic. In Section 2.6 
we will see how easy it is to make elementarily equivalent structures of 
differing cardinalities. 

Example, revisited. We had an isomorphism h from (P; < P ) onto 
(N; <n)- So in particular, (P; < P ) = (N; <#); these structures 
are indistinguishable by first-order sentences. 

We furthermore noted that the identity map was an isomor¬ 
phic embedding of (P; <p) into (N; <n)- Hence for a function s: 
V -> P and a quantifier-free <p, 

N(P;</>) |=(N;<at) < P[‘ S ]- 

This equivalence may fail if <p contains quantifiers. For example, 
N(P;< P ) V V 2 {Vi / V 2 -* Vi < W2)|[l]|, 
but 


^(N;< w ) V V 2 (Vi v 2 —>• Vi < V>)[1]]. 

An automorphism of the structure 21 is an isomorphism of 21 onto 
21. The identity function on |2l| is trivially an automorphism of 21. 21 
may or may not have nontrivial automorphisms. (We say that 21 is rigid 
if the identity function is its only automorphism.) As a consequence of 
the homomorphism theorem, we can show that an automorphism must 
preserve the definable relations: 

Corollary 22E Let h be an automorphism of the structure 21, and 
let R be an n-ary relation on |2l| definable in 21. Then for any 
a\, ..., a n in |2l|, 

(fli,..., a„) e R <3> (h(a i),..., h(a n )) e R. 

Proof. Let ip be a formula that defines R in 21. We need to know 
that 

1=21 (plcii ,..., 1=21 cplh(ai),h(a n )J. 

But this is immediate from the homomorphism theorem. H 

This corollary is sometimes useful in showing that a given relation 
is not definable. Consider, for example, the structure (M; <) consisting 
of the set of real numbers with its usual ordering. An automorphism 
of this structure is simply a function h from M onto M that is strictly 
increasing: 


a < b O h(a) < h{b). 
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One such automorphism is the function h for which h(ci) — a 3 . Since 
this function maps points outside of N into N, the set N is not definable 
in this structure. 

Another example is provided by elementary algebra books, which 
sometimes explain that the length of a vector in the plane cannot be 
defined in terms of vector addition and scalar multiplication. For the 
map that takes a vector x into the vector 2x is an automorphism of the 
plane with respect to vector addition and scalar multiplication, but it is 
not length-preserving. From our viewpoint, the structure in question, 

(E\ + , f r )re R> 

has for its universe the plane E, has the binary operation + of vector 
addition, and has (for each r in the set R) the unary operation f r of scalar 
multiplication by r. (Thus the language in question has a one-place 
function symbol for each real number.) The doubling map described 
above is an automorphism of this structure. But it does not preserve the 
set of unit vectors, 


{x | x e E and x has length 1}. 

So this set cannot be definable in the structure. (Incidentally, the homo- 
morphisms of vector spaces are called linear transformations.) 


Exercises 

1. Show that (a) F; a \= tp iff T |= (a —* <p); and (b) q> |==] if iff 

t= W ft)- 

2. Show that no one of the following sentences is logically implied 
by the other two. (This is done by giving a structure in which the 
sentence in question is false, while the other two are true.) 

(a) V.rV y V z(Pxy —> Pyz —»• Pxz ). Recall that by our conven¬ 
tion a —*■/!-> y is a —► (f —► y). 

(b) Vx V y(Pxy —* Pyx —► x — y). 

(c) V.r 3 y Pxy —»• 3 y V x Pxy. 

3. Show that 

{V.r(a —* f), Vx a } |= Vr f>. 

4. Show that if x does not occur free in a, then a \= Vx a. 

5. Show that the formula x = y —» Pzfx —> Pzfy (where / is a 
one-place function symbol and P is a two-place predicate symbol) 
is valid. 

6. Show that a formula 6 is valid iff V x 0 is valid. 

7. Restate the definition of “21 satisfies q> with s” in the way described 
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on page 84. That is, define by recursion a function h such that 21 
satisfies (p with .s iff s e h(<p). 

8. Assume that E is a set of sentences such that for any sentence r, 
either E |= r or E ^ -it. Assume that 21 is a model of E. Show 
that for any sentence r, we have |= 2 i r iff E |= r. 

9. Assume that the language has equality and a two-place predicate 
symbol P. For each of the following conditions, find a sentence a 
such that the structure 21 is a model of a iff the condition is met. 

(a) 1211 has exactly two members. 

(b) P'~ [ is a function from |2l| into |2t|. (A function is a single-valued 
relation, as in Chapter 0. For / to be a function from A into B, 
the domain of f must be all of A; the range of / is a subset, 
not necessarily proper, of B.) 

(c) P :i is a permutation of |2t|; i.e., P 21 is a one-to-one function 
with domain and range equal to |2l|. 

10. Show that 

l=2i Vt> 2 Qviv 2 lc^J iff |=2i Vv 2 Qcv 2 . 

Here Q is a two-place predicate symbol and c is a constant symbol. 

11. For each of the following relations, give a formula which defines 
it in (N; +, ■)• (The language is assumed to have equality and the 
parameters V, +, and •)• 

(a) {0}. 

(b) {1}. 

(c) {(;«, n) | n is the successor of m in N}. 

(d) {(;«, n) | m < n in N}. 

Digression: This is merely the tip of the iceberg. A relation on 
N is said to be arithmetical if it is definable in this structure. 
All decidable relations are arithmetical, as are many others. 
The arithmetical relations can be arranged in a hierarchy; see 
Section 3.5. 

12. Let IK be the structure (R; +, •)• (The language is assumed to have 
equality and the parameters V, +, and •. 93 is the structure whose 
universe is the set R of real numbers and such that +' 3i and are 
the usual addition and multiplication operations.) 

(a) Give a formula that defines in 93 the interval [0, oo). 

(b) Give a formula that defines in 93 the set {2}. 

*(c) Show that any finite union of intervals, the endpoints of which 
are algebraic, is definable in 93. (The converse is also true; these 
are the only definable sets in the structure. But we will not prove 
this fact.) 

13. Prove part (a) of the homomorphism theorem. 
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14. What subsets of the real line R are definable in (R; <) ? What subsets 
of the plane R x R are definable in (R; <)? 

Remarks: The nice thing about (R; <) is that its automorphisms 
are exactly the order-preserving maps from R onto itself. But stop 
after the binary relations. There are 2 13 definable ternary relations, 
so you do not want to catalog all of them. 

15. Show that the addition relation, {(m, n, p)\ p — m + n}, is not 
definablein (N; •)• Suggestion: Consideran automorphism of (N; •) 
that switches two primes. 

Digression: Algebraically, the structure of the natural numbers 
with multiplication is nothing but the free Abelian semigroup with 
Ko generators (viz. the primes), together with a zero element. There 
is no way you could define addition here. If you could define addi¬ 
tion, then you could define ordering (by Exercise 11 and the natural 
transitivity statement). But one generator looks just like another. 
That is, there are 2 K ° automorphisms — simply permute the primes. 
None of them is order-preserving except the identity. 

16. Give a sentence having models of size 2 n for every positive inte¬ 
ger n, but no finite models of odd size. (Here the language should 
include equality and will have whatever parameters you choose.) 
Suggestion: One method is to make a sentence that says, “Every¬ 
thing is either red or blue, and / is a color-reversing permutation.” 

Remark: Given a sentence a, it might have some finite models 
(i.e., models with finite universes). Define the spectrum of a to be 
the set of positive integers n such that a has a model of size n. This 
exercise shows that the set of even numbers is a spectrum. 

For example if a is the conjunction of the field axioms (there 
are only finitely many, so we can take their conjunction), then its 
spectrum is the set of powers of primes. This fact is proved in any 
course on finite fields. The spectrum of —i a, by contrast, is the set 
of all positive integers (non-fields come in all sizes). 

Gunter Asser in 1955 raised the question: Is the complement of 
every spectrum a spectrum? Once you realize that simply taking 
a negation does not work (cf. the preceding paragraph), you see 
that this is a nontrivial question. In fact the problem, known as the 
spectrum problem, is still open. But modern work has tied it to 
another open problem, whether or not co-NP = NP. 

17. (a) Consider a language with equality whose only parameter (aside 

from V) is a two-place predicate symbol P. Show that if 21 is 
finite and 21 = 93, then 21 is isomorphic to 23. Suggestion: 
Suppose the universe of 21 has size n. Make a single sentence a 
of the form 3 V\ ■ ■ • 3 v n 9 that describes 21 “completely.” That 
is, on the one hand, a must be true in 21. And on the other hand, 
any model of o must be exactly like (i.e., isomorphic to) 21. 
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*(b) Show that the result of part (a) holds regardless of what param¬ 
eters the language contains. 

18. A universal (Vi) formula is one of the form Vxi • • • V.r„ 0, where 
6 is quantifier-free. An existential (3 1 ) formula is of the dual form 
3 x\ • • • 3 x n 6. Let 21 be a substructure of <B, and let s : V —> |2l|. 

(a) Show that if |=si i/r [ 5 ] and i/r is existential, then |=rg i/r[s]. And 
if ^[ 5 ] and <p is universal, then |= 2 i (p[s\. 

(b) Conclude that the sentence 3 x Px is not logically equivalent to 
any universal sentence, nor V x Px to any existential sentence. 

Remark: Part (a) says (when (p is a sentence) that any univer¬ 
sal sentence is “preserved under substructures.” Being universal 
is a syntactic property — it has to do with the string of symbols. 
In contrast, being preserved under substructures is a semantic 
property — it has to do with satisfaction in structures. But this 
semantic property captures the syntactic property up to logical 
equivalence (which is all one could ask for). That is, if a is a 
sentence that is always preserved under substructures, then a 
is logically equivalent to a universal sentence. (This fact is due 
to Los and Tarski.) 

19. An 32 formula is one of the form 3x\ ■ ■ ■ 3 x„ 9, where 0 is 
universal. 

(a) Show that if an 32 sentence in a language not containing func¬ 
tion symbols (not even constant symbols) is true in 21 , then it 
is true in some finite substructure of 21 . 

(b) Conclude that V x 3 y Pxy is not logically equivalent to any 32 
sentence. 

20. Assume the language has equality and a two-place predicate symbol 
P. Consider the two structures (PL <) and (R; <) for the language, 
(a) Find a sentence true in one structure and false in the other. 

*(b) Show that any 32 sentence (as defined in the preceding exer¬ 
cise) true in (R; <) is also true in (N; <). Suggestion: First, 
for any finite set of real numbers, there is an automorphism 
of (R; <) taking those real numbers to natural numbers. Sec¬ 
ondly, by Exercise 18, universal formulas are preserved under 
substructures. 

21. We could consider enriching the language by the addition of a new 
quantifier. The formula 3 \x a (read “there exists a unique x such 
that a”) is to be satisfied in 21 by s iff there is one and only one 
a e |2l| such that |=a a[s(.v | a)]. Assume that the language has the 
equality symbol and show that this apparent enrichment comes to 
naught, in the sense that we can find an ordinary formula logically 
equivalent to 3!.r a. 

22. Assume that 21 is a structure and h is a function with ran h = |2t|. 
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Show that there is a structure 23 such that h is a homomorphism of 
18 onto 21. Suggestion: We need to take |*8| = dom/ 7 . In general, 
the axiom of choice will be needed to define the functions in *8, 
unless h is one-to-one. 

Remark: The result yields an “upward Lowenheim-Skolem the¬ 
orem without equality” (cf. Section 2.6). That is, any structure 21 
has an extension to a structure 18 of any higher cardinality such that 
21 and $ are elementarily equivalent, except for equality. There is 
nothing deep about this. Not until you add equality. 

23. Let 21 be a structure and g a one-to-one function with dom g = 12l|. 
Show that there is a unique structure '8 such that g is an isomor¬ 
phism of 21 onto 18. 

24. Let h be an isomorphic embedding of 21 into 18. Show that there is 
a structure £ isomorphic to 18 such that 21 is a substructure of £. 
Suggestion: Let g be a one-to-one function with domain I®! such 
lhatg(/?(fl)) = a for a e 2l|. Form £ such that g is an isomorphism 
of 18 onto £. 

Remark: The result stated in this exercise should not seem sur¬ 
prising. On the contrary, it is one of those statements that is obvious 
until you have to prove it. It says that if you can embed 21 isomor- 
phically into 18, then for all practical purposes you can pretend 21 
is a substructure of 18. 

25. Consider a fixed structure 21. Expand the language by adding a 
new constant symbol c a for each a e |2t|. Let 2l + be the struc¬ 
ture for this expanded language that agrees with 21 on the original 
parameters and that assigns to c a the point a. A relation R on 
|2l| is said to be definable from points in 21 iff R is definable in 
2l + . (This differs from ordinary definability only in that we now 
have parameters in the language for members of |2l|.) Let 93 = 
(R; <, +, •)• 

(a) Show that if A is a subset of R consisting of the union of finitely 
many intervals, then A is definable from points in 93 (cf. Exer¬ 
cise 12). 

(b) Assume that 21 = 93. Show that any subset of |2t| that is non¬ 
empty, bounded (in the ordering <‘ l ), and definable from points 
in 21 has a least upper bound in |2l|. 

Digression: Often when people speak of definability within a 
structure, this is the concept they mean. The more standard phrase 
is “definable from parameters”; here “points” is used because the 
word “parameter” is used in a different sense in this chapter. 

The real ordered field can be characterized up to isomorphism 
by saying that it is a complete ordered field. (This fact should 
be included in any analysis course.) But completeness (i.e., that 
nonempty bounded sets have least upper bounds) is not a first- 
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order property. See Example 4 in Section 4.1 for its second-order 
statement. The first-order “image” of completeness is given by the 
schema obtained from that second-order statement by replacing X 
by a first-order formula (p. The resulting schema (i.e., the set of 
sentences you get by letting <p vary and taking universal closure) 
says that the least-upper-bound property holds for the sets that are 
definable from points. Ordered fields satisfying those sentences are 
called “real closed-ordered fields.” 

The surprising fact is that such fields were not invented by logi¬ 
cians. They were previously studied by algebraists and you can read 
about them in van der Waerden’s Modern Algebra book (volume I). 
Of course, he uses a characterization of them that does not involve 
logic. 

What Tarski showed is that any real closed-ordered field is el¬ 
ementarily equivalent to the field of real numbers. From this it 
follows that the theory of the real-ordered field is decidable, 

26. (a) Consider a fixed structure 21 and define its elementary type to 

be the class of structures elementarily equivalent to 21. Show 
that this class is ECa- Suggestion: Show it is Mod Th2l. 

(b) Call a class /C of structures elementarily closed or ECL if when¬ 
ever a structure belongs to K. then all elementarily equivalent 
structures also belong. Show that any such class is a union of 
ECa classes. (A class that is a union of ECa classes is said to 
be an ECas class; this notation is derived from topology.) 

(c) Conversely, show that any class that is the union of ECa classes 
is elementarily closed. 

27. Assume that the parameters of the language are V and a two-place 
predicate symbol P. List all of the non-isomorphic structures of 
size 2. That is, give a list of structures (where the universe of each 
has size 2) such that any structure of size 2 is isomorphic to exactly 
one structure on the list. 

28. For each of the following pairs of structures, show that they are 
not elementarily equivalent, by giving a sentence true in one and 
false in the other. (The language here contains V and a two-place 
function symbol o.) 

(a) (R; x) and (R*; x*), where x is the usual multiplication op¬ 
eration on the real numbers, R* is the set of non-zero reals, and 
x* is x restricted to the non-zero reals. 

(b) (N; +) and (P; +*), where P is the set of positive integers, and 
+* is usual addition operation restricted to P. 

(c) Better yet, for each of the four structures of parts (a) and (b), 
give a sentence true in that structure and false in the other 
three. 
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SECTION 2.3 _ 

A Parsing Algorithm 1 

As in sentential logic, we need to know that we can decompose formulas 
(and terms) in a unique way, discovering how they are built up. The 
uniqueness is necessary to justify our definitions by recursion, such as 
the definition of satisfaction in the preceding section. 

For terms we used Polish notation; for formulas we relied on paren¬ 
theses. Accordingly, we first consider a decomposition procedure for 
terms, showing unique readability. And then we extend the methods to 
cover the formulas. 

Recall that the terms are built up from the variables and constant 
symbols by operations corresponding to the function symbols. We now 
define a function K on the symbols involved such that for a symbol s, 
K(s) — 1 — n, where n is the number of terms that must follow .y to 
obtain a term: 

K (x) = 1-0=1 for a variable x; 

K(c) — 1 — 0 — 1 for a constant symbol c; 

K (/) = 1 — n for an n-place function symbol /. 

We then extend K to the set of expressions using these symbols by 
defining 

K{s\S 2 ■ ■ ■ s„) — K(si) + K(s 2 ) + ■ ■ ■ + K(s n ). 

Since no symbol is a finite sequence of others, this definition is unam¬ 
biguous. 

Lemma 23A For any term t, K(t) — 1. 

Proof. Use induction on t. The inductive step, for an n-place func¬ 
tion symbol /, is 

K(ft\■ ■ ■ t n ) = (1 — n) + (1 + • • • + 1 ) = 1. 

n times I 

In fact, K was chosen to be the unique function on these symbols 
for which Lemma 23A holds. It follows from this lemma that if e is a 
concatenation of m terms, then K(s) = m. 

By a terminal segment of a string {s 1; ..., s„) of symbols we mean 
a sequence of the form (,s>, s*+i,..., s n ), where 1 < k < n. 

Lemma 23B Any terminal segment of a term is a concatenation of 
one or more terms. 

1 This section may be omitted by a reader willing to accept the meaningfulness of our 
many definitions by recursion. 
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Proof. We use induction on the term. For a one-symbol term (i.e., 
a variable or a constant symbol) the conclusion follows trivially. 
For a term ft\ ■ ■ ■ t„, any terminal segment (other than the term 
itself) must equal 

A+1 ''' A, 

where k < n and t' k is a terminal segment of t k . By the inductive 
hypothesis t' k is a concatenation of, say, m terms, where m > 1. 
So altogether we have m + (n — k ) terms. H 

Corollary 23C No proper initial segment of a term is itself a term. 
If t\ is a proper initial segment of a term f, then K{t\) < 1. 

Proof. Suppose a term t is divided into a proper initial segment t\ 
and a terminal segment to. Then 1 = K(t) = K(t\) + K(to), and 
by Lemma 23B, K(t 2 ) > 1. Hence K(t 1 ) < 1 and t\ cannot be 
a term. H 

Parsing Terms 

We want an algorithm that, given an expression, will determine whether 
or not the expression is a legal term, and if it is, will construct the unique 
tree showing how the term is built up. 

Assume then that we are given an expression. We will construct a 
tree, with the given expression at the top (i.e,, the root). Initially it is 
the only vertex in the tree, but as the procedure progresses, the tree will 
grow downward. 

The algorithm consist of the following two steps. 

1. If each minimal vertex (at the bottom) has a single symbol (which 
must* be a variable or a constant symbol), then the procedure is com¬ 
pleted. (The given expression is indeed a term, and we have constructed 
its tree.) Otherwise, select a minimal vertex having an expression with 
two or more symbols. We examine that expression. 

2. The first symbol must* be an n-place function symbol, say /, 
where n > 0. We extend the tree downward by creating n new vertices 
below the present one. Scan the expression after the /, until first reaching 
a string t (of variables, constant symbols, and function symbols) with 
K(t) — 1 .** Then t is the expression that goes at the leftmost unlabeled 
new vertex. Repeat with the remainder of the expression, until all n new 
vertices have been labeled, and the expression has been exhausted.** 
Return to step 1. 


* If not, then the expression here is not a term. We reject the given expression as a 
non-term, and halt. 

** If the end of the expression is reached before finding such an t , then the expression here 
is not a term. We reject the given expression as a non-term, and halt. 
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As in Section 1.3, the crucial point is that the tree could not have 
been made differently. In step 2, we selected the first string t we found 
with K (t) = 1. We could not have used less than t (because we needed 
K(t) — 1 by Lemma 23 A). We could not have used more than t (because 
that longer string would have the proper initial segment t with K (t) — 1 , 
contradicting Corollary 23C). The choice of t was the only possible 
choice. 

When the algorithm halts, either it has rejected the given expression 
as a non-term, or it has constructed the one tree demonstrating that the 
given expression is a legal tree. 

We can rephrase the uniqueness in the terminology of Section 1.4 as 
follows: 

Unique Readability Theorem for Terms The set of terms is freely 
generated from the set of variables and constant symbols by the 
Tf operations. 

Proof. First, it is clear that iff / ^ g, then ran Tf is disjoint from 
ran T k \ this requires checking only the first symbol. Furthermore, 
both ranges are disjoint from the set of variables and constant 
symbols. It remains only to show that Tf , when restricted to 
terms, is one-to-one. Suppose, for a two-place /, we have 

fhh = fht\. 

By deleting the first symbol we are left with 
hh = hh- 

If t\ ^ ? 3 , then one would be a proper initial segment of the other, 
which is impossible for terms by Corollary 23C. So t\ — h, and 
we are then left with Q = U. H 

Parsing Formulas 

To extend this argument to formulas, we now define K on the other 
symbols: 

*(0 = - i ; 

*0) = i; 

*(V) = -i; 

* (-0 = 0 ; 
ff(—) = -l; 

K(P) = 1-/7 for an »-place predicate symbol P; 

K(=) = - 1. 

The idea behind the definition is again that K ( s ) should be 1 — n, where 
n is the number of things (right parentheses, terms, or formulas) required 
to go after s. We extend K as usual to the set of all expressions: 


K Ovi • ■ ■ s n ) = -F K(s n ). 
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Lemma 23D For any wff a, K(a) — 1. 

Proof. Another straightforward induction. H 

Lemma 23E For any proper initial segment a' of a wff a, K (a') < I. 

Proof. Use induction on a. The details are left to Exercise 1. H 

Corollary 23F No proper initial segment of a formula is itself a 
formula. 

Armed with these facts, we can proceed as in Section 1.3. Instead of 
sentence symbols at the minimal vertices, we now have atomic formulas 
(which are distinguished by having first an n -place predicate symbol, 
followed by n terms). 

A wff that is non-atomic must begin either with V Vj or with (. In 
the former case we have one new vertex; in the latter case we need to 
examine the next symbol to see if it is —i. If not, then we can either count 
parentheses or use the K function — both methods work — to find the 
correct split. 

Again, the uniqueness can be phrased in the terminology of Sec¬ 
tion 1.4 as follows: 

Unique Readability Theorem for Formulas The set of wffs is freely 
generated from the set of atomic formulas by the operations 
£^, £^>, and Q,- (i = 1,2,...). 

Proof. The unary operations £^ and Q, are obviously one-to-one. 
As in Section 1.4, we can show that the restriction of £to wffs 
is one-to-one. 

The disjointness half of the theorem follows from the ad hoc 
observations: 

1. ran £-,, ran Q,, ran Qj , and the set of atomic formulas are 
pairwise disjoint, for i ^ j. (lust look at the first two symbols.) 

2. ran £ ran Qj , ran Q ; , and the set of atomic formulas are 

similarly pairwise disjoint, for i ^ j. 

3. For a wff [$,{—* a) ^ (/l — y), because no wff begins 

with —i. Hence ran £-, is disjoint from the range of the restriction 
of £_♦ to wffs. H 


Exercises 

1. Show that for a proper initial segment a' of a wff a, we have 
K(a') <1. 

2. Let e be an expression consisting of variables, constant symbols, and 
function symbols. Show that e is a term iff K(s) — 1 and for every 
terminal segment s' of s we have K(s') > 0. Suggestion: Prove the 
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stronger result that if K(s') > 0 for every terminal segment s' of s, 
then s is a concatenation of K (e) terms. (This algorithm is due to 
Jaskowski.) 


SECTION 2.4 _ 

A Deductive Calculus 

Suppose that £ |=r. What methods of proof might be required to 
demonstrate that fact? Is there necessarily a proof at all? 

Such questions lead immediately to considerations of what consti¬ 
tutes a proof. A proof is an argument that you give to someone else 
and that completely convinces that person of the correctness of your 
assertion (in this case, that £ |= r). 

Thus a proof should be finitely long, as you cannot give all of an 
infinite object to another person. If the set £ of hypotheses is infinite, 
they cannot all be used. But the compactness theorem for first-order 
logic (which we will prove in Section 2.5 using the deductive calculus 
of this section) will ensure the existence of a finite £ (l C £ such that 
£ohr. 

Another essential feature of a proof (besides its being finite in length) 
is that it must be possible for someone else (if that person is to be 
convinced by it) to check the proof to ascertain that it contains no fal¬ 
lacies. This check must be effective; it must be the sort of thing that 
can be carried out without brilliant flashes of insight on the part of 
the checker. In particular, the set of proofs from the empty set of hy¬ 
potheses (i.e., proofs that (= r) should be decidable. This implies that 
the set of formulas provable without hypotheses must be effectively 
enumerable. For one could in principle enumerate provable sentences 
by generating all strings of symbols and sorting out the proofs from 
the nonproofs. When a proof is discovered, its last line is entered on 
the output list. (This issue will be examined more carefully at the end 
of Section 2.5.) But here again there is a theorem (the enumerability 
theorem, proved in Section 2.5) stating that, under reasonable condi¬ 
tions, the validities — the set of valid formulas — is indeed effectively 
enumerable. 

Thus the compactness theorem and the enumerability theorem are 
necessary conditions for satisfactory proofs of logical implication al¬ 
ways to exist. Conversely, we claim that these two theorems are suffi¬ 
cient for proofs (of some sort) to exist. For suppose that £ |= r. By 
the compactness theorem, then, there is a finite set {<r 0 ,..., er„} C £ 
that logically implies r. Then ero —* • • • —► er„ —► r is valid (Exercise 1, 
Section 2.2). So to demonstrate conclusively that £ |= r one need only 
carry out a finite number of steps in the enumeration of the validities 
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until <7o —*■ ■ ■ ■ —> (J„ —► r appears, and then verify that each a, e £. 
(This should be compared with the complex procedure suggested by the 
original definition of logical implication, discussed in Section 2.2.) The 
record of the enumeration procedure that produced <jq —► • • —► er„ —► r 
can then be regarded as a proof that £ |= r. As a proof, it should be 
acceptable to anyone who accepts the correctness of your procedure for 
enumerating validities. 

Against the foregoing general (and slightly vague) discussion, the 
outline of this section can be described as follows: We will introduce 
formal proofs but we will call them deductions, to avoid confusion 
with our English-language proofs. These will mirror (in our model of 
deductive thought) the proofs made by the working mathematician to 
convince his or her colleagues of certain truths. Then in Section 2.5 we 
will show that whenever £ \= r, there is a deduction of r from £ (and 
only then). This will, as is suggested by the foregoing discussion, yield 
proofs of the compactness theorem and the enumerability theorem. And 
in the process we will get to see what methods of deduction are adequate 
to demonstrate that a given sentence, is, in fact, logically implied by 
certain others. In other words, our goal is to produce a mathematically 
precise concept of deduction that, in the context of first-order logic, is 
adequate and correct. 

Formal Deductions 

We will shortly select an infinite set A of formulas to be called logical 
axioms. And we will have a rule of inference, which will enable us to 
obtain a new formula from certain others. Then for a set F of formulas, 
the theorems of F will be the formulas which can be obtained from 
r U A by use of the rule of inference (some finite number of times). If 
(p is a theorem of F (written F h <p), then a sequence of formulas that 
records (as explained below) how (p was obtained from F U A with the 
rule of inference will be called a deduction of <p from T. 

The choice of A and the choice of the rule (or rules) of inference 
are far from unique. In this section we are presenting one deductive 
calculus for first-order logic, chosen from the array of possible calculi. 
(For example, one can have A = 0 by using many rules of inference. 
We will take the opposite extreme; our set A will be infinite but we will 
have only one rule of inference.) 

Our one rule of inference is traditionally known as modus ponens. It 
is usually stated: From the formulas a and a —► /J we may infer /): 

a, a —► /S 

P ' 

Thus the theorems of a set T are the formulas obtainable from T U A 
by use of modus ponens some finite number of times. 
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Definition. A deduction ofipfromT is a finite sequence (a o,..., a n ) 
of formulas such that a n is ip and for each k <n, either 

(a) oik is in T U A, or 

(b) oik is obtained by modus ponens from two earlier formulas 
in the sequence; that is, for some i and j less than k, a s is a,- -> «/,. 

If such a deduction exists, we say that (p is deducible from T, or that 
ip is a theorem of T, and we write T \- q>. 

There is another viewpoint that is useful here: A deduction of <p from 
T can be viewed as a construction sequence, showing how ip can be 
obtained from the set T U A by applying modus ponens zero or more 
times. (We hesitate to say that tp is built “up” from TUA. Unlike the 
formula-building operations that produce longer formulas from shorter 
ones, modus ponens can produce shorter formulas from longer ones.) 
That is, the set of theorems of F is exactly the set of formulas that are 
obtainable from the “base” set T U A by applying modus ponens. 

The situation here differs from the one discussed in Section 1.4 in two 
ways, one unimportant and one important. The unimportant difference is 
that we get the set of theorems by closing under the “partially defined” 
operation of modus ponens, whose domain consists only of pairs of 
formulas of the form (a, a —> fi) (in contrast to the “totally defined” 
formula-building operations). The more important difference is that the 
set of theorems is not freely generated from T U A by modus ponens. 
This reflects the fact that a theorem never has a unique deduction. In 
Sections 1.3 and 2.3, we were concerned to show that for any wff ip, 
there was a unique tree (which we could effectively calculate) showing 
how ip was built up by use of formula-building operations. Now the tree 
is by no means unique, and calculating one such tree is a very different 
matter from before. 

Nevertheless, this viewpoint does yield the following induction prin¬ 
ciple. We say that a set S of formulas is closed under modus ponens if 
whenever both a e S and (a —> fi) e S then also f> e S. 

Induction Principle Suppose that S is a set of wffs that includes 
TUA and is closed under modus ponens. Then S contains every 
theorem of T. 

For example, if the formulas fi, y, and y —► fi —► a are all in T U A, 
then T F- a, as is evidenced by the tree which displays how a was 

a 


fi 


fi —► a 

/\ 


7 y —*■ fi a 
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obtained. Although it is tempting (and in some ways more elegant) to 
define a deduction to be such a tree, it will be simpler to take deductions 
to be the linear sequences obtained by squashing such trees into straight 
lines. 

Now at last we give the set A of logical axioms. These are arranged 
in six groups. Say that a wff (p is a generalization of xft iff for some 
n > 0 and some variables x\,... ,x n , 

<p = Vxi • • • V x n ftr. 

We include the case n — 0; any wff is a generalization of itself. The 
logical axioms are then all generalizations of wffs of the following 
forms, where x and y are variables and a and ft are wffs: 

1. Tautologies; 

2. Via —► af, where t is substitutable for x in a; 

3. Vx(a —► ft) —► (Via —► V x ft); 

4. a —>Via, where x does not occur free in a. 


And if the language includes equality, then we add 

5. x — x; 

6. x — y —► (a —> a '), where a is atomic and a' is obtained from a 
by replacing x in zero or more (but not necessarily all) places by y. 

For the most part groups 3-6 are self-explanatory; we will see var¬ 
ious examples later. Groups 1 and 2 require explanation. But first we 
should admit that the above list of logical axioms may not appear very 
natural. Later it will be possible to see where each of the six groups 
originated. 

Substitution 

In axiom group 2 we find 

Via —y a'. 

Here a f v is the expression obtained from the formula a by replacing the 
variable x, wherever it occurs free in a, by the term t. This concept can 
also be (and for us officially is) defined by recursion: 


1. For atomic a, aft is the expression obtained from a by replacing 
the variable x by t. (This is elaborated upon in Exercise 1. Note that af 
is itself a formula.) 


2. (-!«)? = (-.«?). 

3. (a — ft)f = (aft - 

4. (V y a)* = f y . ya 


n 

if x — y, 
|Vy(af) if x^y. 
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Examples 

1. <p* = <p. 

2. ( Qx —> Vx Px)* = (Qy -»Vx Px). 

3. If a is —i V y x = y, then V x a —»• af is 

Vx—i V y x = y —*■ V y z = y. 

4. For a as in 3, V x a —► a x is 

Vx —y x = y —*■ —> V y y = y. 

The last example above illustrates a hazard which must be guarded 
against. On the whole, V x a —af seems like a plausible enough axiom. 
(“If a is true of everything, then it should be true of t .”) But in Example 4, 
we have a sentence of the form V x a —* af, which is nearly always false. 
The antecedent, Vx —> V y x = y, is true in any structure whose universe 
contains two or more elements. But the consequent, —i V y y = y, is false 
in any structure. So something has gone wrong. 

The problem is that when y was substituted for x, it was immediately 
“captured” by the V y quantifier. We must impose a restriction on axiom 
group 2 that will preclude this sort of quantifier capture. Informally, we 
can say that a term t is not substitutable for x in a if there is some variable 
y in t that is captured by a V y quantifier in a*. The real definition is 
given below by recursion. (Since the concept will be used later in proofs 
by induction, a recursive definition is actually the most usable variety.) 

Let x be a variable, t a term. We define the phrase “t is substitutable 
for x in a ” as follows: 

1. For atomic a, t is always substitutable for x in a. (There are no 
quantifiers in a, so no capture could occur.) 

2. t is substitutable for x in (-i a) iff it is substitutable for x in a. t 
is substitutable for x in (a —<- /l) iff it is substitutable for x in both a 
and fi. 

3. t is substitutable for x in V y a iff either 

(a) x does not occur free in V y a, or 

(b) y does not occur in t and t is substitutable for x in a. 

(The point here is to be sure that nothing in t will be captured by the 
V v prefix and that nothing has gone wrong inside a earlier.) 

For example, x is always substitutable for itself in any formula. If t 
contains no variables that occur in a , then t is substitutable for x in a. 

The reader is cautioned not to be confused about the choice of words. 
Even if t is not substitutable for x in a, still is obtained from a by 
replacing x wherever it occurs free by t. Thus in forming af, we carry 
out the indicated substitution even if a prudent person would think it 
unwise to do so. 
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Axiom group 2 consists of all generalizations of formulas of the form 
V x a —► a *, 

where the term t is substitutable for the variable x in the formula a. For 
example, 

V Vt,( V V\(Avx —»• V ViAvi) —»• (Av2 —»• V v^Avi)) 

is in axiom group 2. Here x is vu a is Avi —V V 2 AV 2 , and t is u 2 - On 
the other hand, 


V vi V V 2 BV 1 V 2 —* V V 2 BV 2 V 2 

is not in axiom group 2, since V 2 is not substitutable for V\ i 11 V v 2 B v\ v 2 . 

Tautologies 

Axiom group 1 consists of generalizations of formulas to be called 
tautologies. These are the wffs obtainable from tautologies of sentential 
logic (having only the connectives -1 and —►) by replacing each sentence 
symbol by a wff of the first-order language. For example, 

Vx[( Vy -1 Py —* -1 Px) —>■ (Px —► -1 Vy -1 Py)] 

belongs to axiom group 1. It is a generalization of the formula in square 
brackets, which is obtained from a contraposition tautology 

(A —» -1 B) —> (B —> -1 A) 

by replacing A by V y -1 Py and B by Px. 

There is another, more direct, way of looking at axiom group 1. 
Divide the wffs into two groups: 

1. The prime formulas are the atomic formulas and those of the form 
V.r a. 

2. The nonprime formulas are the others, i.e., those of the form —1 a 
or a —► ft. 

Thus any formula is built up from prime formulas by the operations 
£and £_». Now go back to sentential logic, but take the sentence 
symbols to be the prime formulas of our first-order language. Then any 
tautology of sentential logic (that uses only the connectives — 1 , —►) is in 
axiom group 1. There is no need to replace sentence symbols here by 
first-order wffs; they already are first-order wffs. Conversely, anything 
in axiom group 1 is a generalization of a tautology of sentential logic. 
(The proof of this uses Exercise 8 of Section 1.2.) 

Example, revisited. 

(V y —i Py —y -> Px) —* (Px —► -1 Vy-i Py). 
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This has two sentence symbols (prime formulas), Vy ~> Py and Px. 
So its truth table has four lines: 


(V _y -i Py —y -> Px) —> 

T F F T T 

T T T F T 

F T F T T 

F T T F T 


(Px —y -i\/y-i Py) 
T F F T 
F T F T 
T T T F 
F T T F 


From the table we see that it is indeed a tautology. 

On the other hand, neither Vx(Px —>- Px) nor V x Px —> Px is 

a tautology. 

One remark: We have not assumed that our first-order language has 
only countably many formulas. So we are potentially employing an 
extension of Chapter 1 to the case of an uncountable set of sentence 
symbols. 

A second remark: Taking all tautologies as logical axioms is overkill. 
We could get by with much less, at the expense of lengthening deduc¬ 
tions. On the one hand, the tautologies form a nice decidable set (the 
decidability will be important for the enumerability theorem in Sec¬ 
tion 2.5). On the other hand, there is no known fast decision procedure 
for tautologies, as noted in Section 1.7. One option would be to cut 
axiom group 1 down to a set of tautologies for which we do know de¬ 
cision procedures that are fast (the technical term is “polynomial-time 
decidable”). The other tautologies would then be obtainable from these 
by use of modus ponens. 

A third remark: Now that first-order formulas are also wffs of sen¬ 
tential logic, we can apply concepts from both Chapters 1 and 2 to them. 
If T tautologically implies <p, then it follows that T also logically im¬ 
plies ip. (See Exercise 3.) But the converse fails. For example, Vx Px 
logically implies Pc. But Vx Px does not tautologically imply Pc, as 
Vx Px and Pc are two different sentence symbols. 

Theorem 24B T F- (p iff T U A tautologically implies <p. 

Proof. (=F): This depends on the obvious fact that [a., a —»• /)} 
tautologically implies p. Suppose that we have a truth assignment 
v that satisfies every member of T U A. By induction we can see 
that v satisfies any theorem of T. The inductive step uses exactly 
the above-mentioned obvious fact. 

(<(=): Assume that T U A tautologically implies <p. Then by 
the corollary to the compactness theorem (for sentential logic), 
there is a finite subset [yi,.. ., y m , Xi,. .., X n ] that tautologically 
implies (p. Consequently, 

Yi * * * * * Ym * * * * * 7/j * (p 
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is a tautology (cf. Exercise 4 of Section 1.2) and hence is in A. 
By applying modus ponens m + n times to this tautology and to 
{yi, ..., y m , A.i,..., k„] we obtain cp. H 

(The above proof is related to Exercise 7 in Section 1.7. It uses 
sentential compactness for a possibly uncountable language.) 

Deductions and Metatheorems 

We now have completed the description of the set A of logical axioms. 
The set of theorems of a set F is the set built up from TUAby modus 
ponens. For example, 

h Px —► 3 y Py. 

(Here T = 0; we write “h a” in place of “0 h a.”) The formula 
Px —»3 y Py can be obtained by applying modus ponens (once) to two 
members of A, as displayed by the pedigree tree: 

Px —*■ —iVy—iPy 

Vy —*Py — 

(in axiom 

( Vv —i Py — —iPx) -*■ (Px -*■ —iVy—iPy) 

(in axiom group 1) 

We get a deduction of Px —► 3 y Py (from 0) by compressing this 
tree into a linear three-element sequence. 

As a second example, we can obtain a generalization of the formula 
in the first example: 

h Vx(Px —> 3 y Py). 

This fact is evidenced by the following tree, which displays the con¬ 
struction of Vx(Px —► 3 y Py) from A by modus ponens: 




Vx[(Vy —i Py —*■ —i Px) —(Px —iVj>— 'Py)\ 

(in axiom group 1) 



Chapter 2: First-Order Logic 


117 


Again we can compress the tree into a deduction. 

In these examples the pedigree trees may seem to have been pulled 
out of the air. But we will shortly develop techniques for generating 
such trees in a somewhat systematic manner. These techniques will 
rely heavily on the generalization theorem and the deduction theorem 
below. 

Notice that we use the word “theorem” on two different levels. We say 
that a is a theorem of T if T b a. We also make numerous statements 
in English, each called a theorem, such as the one below. It seems 
unlikely that any confusion will arise. The English statements could 
have been labeled metatheorems to emphasize that they are results about 
deductions and theorems. 

The generalization theorem reflects our informal feeling that if we 

can prove_x_without any special assumptions about x, we then are 

entitled to say that “since x was arbitrary, we have Vx_x_.” 

Generalization Theorem If T I- tp and x do not occur free in any 
formula in T, then T h Vx tp. 

Proof. Consider a fixed set T and a variable x not free in T. We 
will show by induction that for any theorem tp of T, we have 
T h V x tp. For this it suffices (by the induction principle) to show 
that the set 


{tp | T h Vx tp] 

includes TU A and is closed under modus ponens. Notice that x 
can occur free in <p. 

Case 1: tp is a logical axiom. Then V x tp is also a logical axiom. 
And so T h Vx tp. 

Case 2: q> e T. Then x does not occur free in tp. Hence 
tp —► V x tp 

is in axiom group 4. Consequently, T h Vx tp. as is evidenced by 
the tree: 


Vx tp 



tp tp —► Vx tp 

(in F) (in axiom group 4) 


Case 3: tp is obtained by modus ponens from i jj and \[r —*■ 
tp. Then by inductive hypothesis we have T h Vx x/t and T h 
Vx(V —► tp). This is just the situation in which axiom group 3 is 
useful. That T h Vx tp is evidenced by the tree: 
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V„v <p 



Vxy> Vxy-*-Vx<p 



Vx(y>—>- 9 ?) Vx(y> — ► «p) — *■ (Vx ip — ► Va 99 ) 

(in axiom group 3) 

So by induction F F V x 95 for every theorem (p of T. H 

(The sole reasons for having axiom groups 3 and 4 are indicated by 
the above proof.) 

The restriction that x not occur free in F is essential. For example, 
Px \/=- V .r Px, and so by the soundness theorem to appear in Section 2.5, 
PxY- V x Px. On the other hand, x will in general occur free in the 
formula cp. For example, at the beginning of this subsection we showed 
first that 


F (Px —y 3 y Py). 

The second example there, 

F \/x(Px —y 3 y Py), 

was obtained from the first example as in case 3 of the above proof. 
Example. Vx V y a F V y V.r a. 

The proof of the generalization theorem actually yields somewhat 
more than was stated. It shows how we can, given a deduction of 9 ? from 
T, effectively transform it to obtain a deduction of \/x cp from IF 

Lemma 24C (Rule T) If T F on, ..., T F ct„ and {a\, ..., a„) tau¬ 
tologically implies ft, then T F /?. 

Proof, ai —► • • • —> a n —► p is a tautology, and hence a logical 
axiom. Apply modus ponens n times. H 

Deduction Theorem If T; y F <p, then F F (y —» tp). 

(The converse clearly holds also; in fact, the converse is essentially 
the rule modus ponens.) 

First Proof 

T; y F <p iff (T; y) U A tautologically implies 9 >, 

iff T U A tautologically implies (y —> ip), 

iff rF(y-» (p). 
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Second Proof The second proof does not use the compactness the¬ 
orem of sentential logic as does the first proof. It shows in a direct 
way how to transform a deduction of q> from T: y to obtain a de¬ 
duction of (y —» <p) from T. We show by induction that for every 
theorem <p of T; y the formula (y —> <p) is a theorem of T. 

Case 1 : (p — y. Then obviously F- (y — <p). 

Case 2: <p is a logical axiom or a member of T. Then T F- (p. 
And <p tautologically implies (y —> q>), whence by rule T we have 
TF(y^ (p). 

Case 3: q> is obtained by modus ponens from i jr and i// —y (p. By 
the inductive hypothesis, F F- (y —*• i//) and TF(y —► (i jr — yq>)). 
And the set {y —> \j/, y — y(pj/ —» q>)} tautologically implies y — up. 
Thus, by rule T, F F (y —> tp). 


So by induction the conclusion holds for any <p deducible from 

F; Y■ H 


Corollary 24D (Contraposition) 


F; (p I-i xjr iff T; 1- up. 


Proof 


r; ^3 I-il/r^rh^ —► —i i/r 

r f- i/ r — > —up 

=>■ T; i/r I - up 


by the deduction theorem, 
by rule T, 
by modus ponens. 


(In the second step we use the fact that <p —> —njr tautologically 
implies \jj —v —> q>.) By symmetry, the converse holds also. H 


Say that a set of formulas is inconsistent iff for some /3, both ft and 
-i ft are theorems of the set. (In this event, any formula a is a theorem 
of the set, since ft —> ft —*-aisa tautology.) 


Corollary 24E (Reductio ad Absurdum) If F; <p is inconsistent, 
then T I- up. 


Proof. From the deduction theorem we have T F- {(p —► ft) and 
rh(f) —► —i ft). And {( p — y ft, <p —^ —■ yS} tautologically implies 
-up. -ft 


Example. \~3x\/y(p — 

There are strategic advantages to working backward. 

It suffices to show that 3 x V y (p F- V y 3 x <p, by the deduction 
theorem. 
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It suffices to show that 3 x V y <p h 3 x (p, by the generalization 
theorem. 

It suffices to show that —i V x —> V y <p I- 1 Vx —i <p, as this is the 

same as the preceding. 

It suffices to show that Vx -i q> h Vx -i Vy <p, by contraposition 
(and rule T). 

It suffices to show that V x -up I- y<p, by generalization. 

It suffices to show that {Vx -up, V y <p} is inconsistent, by reductio 
ad absurdum. 

And this is easy: 

1 . Vx — i <p I - 1 q> by axiom group 2 and modus ponens. 

2. V y <p h <p for the same reason. 

Lines 1 and 2 show that {Vx -up, V y <p) is inconsistent. 

Strategy 

As the preceding example indicates, the generalization and deduction 
theorems (and to a smaller extent the corollaries) will be very useful 
in showing that certain formulas are deducible. But there is still the 
matter of strategy: For a given T and <p, where should one begin in 
order to show that F h <p? One could, in principle, start enumerating 
all finite sequences of wffs until one encountered a deduction of <p 
from T. Although this would be an effective procedure (for reasonable 
languages) for locating a deduction if one exists, it is far too inefficient 
to have more than theoretical interest. 

One technique is to abandon formality and to give in English a proof 
that the truth of T implies the truth of <p. Then the proof in English can 
be formalized into a legal deduction. (In the coming pages we will see 
techniques for carrying out such a formalization in a reasonably natural 
way.) 

There are also useful methods based solely on the syntactical form 
of <p. Assume then that <p is indeed deducible from T but that you are 
seeking a proof of this fact. There are several cases: 

1. Suppose that q> is (xfr —»• 8). Then it will suffice to show that 
T; i/f h 6 (and this will always be possible). 

2. Suppose that <p is Vx x\r. If x does not occur free in T, then it 
will suffice to show that Thf (Even if x should occur free in T, the 
difficulty can be circumvented. There will always be a variable y such 
that T h Vy x/fy and V y xjr* I- Vx i jr. See the re-replacement lemma. 
Exercise 9.) 

3. Finally, suppose that <p is the negation of another formula. 

3a. If <p is —i(i jr — 0), then it will suffice to show that Phi jr and 

T I- 1 8 (by rule T). And this will always be possible. 

3b. If <p is -i —i xfr, then of course it will suffice to show that r h f 
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3c. The remaining case is where <p is —i Vx i jr. It would suffice to 
show that T b —i \[rf, where t is some term substitutable for x in i//. 
(Why?) Unfortunately this is not always possible. There are cases in 
which 


T I-1 V.r i [r, 

and yet for every term f, 

(One such example is T = 0 , x[r — — i (Px —> V y Py).) Contraposition 
is handy here; 


T; V.r I-' a. 

(A variation on this is: F; V y a I- 1 Vx i]/ if F; Vx i/s I- \ a.) If all 

else fails, one can try reductio ad absurdum. 

Example (Q2a). If a does not occur free in a, then 

b (a —y Vx P) <->■ V.r(« —> jS). 

To prove this, it suffices (by rule T) to show, that 

b (a —s- V x P) —+Vx(u —> / 6 ) 

and 


b V x{a —y /3) —y (cr —y V x ji). 

For the first of these, it suffices (by the deduction and generaliza¬ 
tion theorems) to show that 

{(a-yVxP),a} b 0. 

But this is easy; Vx 0 —y f .1 is an axiom. 

To obtain the converse, 

b Vi(q' —y 0) —y (a —y Vx 0), 

it suffices (by the deduction and generalization theorems) to show 

that 


[Vx(a^0),a] b 0. 

This again is easy. 

In the above example we can replace a by —\ a, 0 by —i 0, and use 
the contraposition tautology (among other things) to obtain the related 
fact: 
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(Q3b). If x does not occur free in a, then 

F (3 x ft —* a) Vx(/3 —* a). 

The reader might want to convince him- or herself that the above 
formula is valid. 

Frequently an abbreviated style is useful in writing down a proof of 
deducibility, as in the following example. 

Example (Eq 2). VxVy(x = y->y=i). 

Proof 

1. F x — y —»■ x = x —*■ y = x. Ax 6. 

2. \- x — x. Ax 5. 

3. I- x — y —* y = x. 1, 2; T. 

4. h VxV y(x — y —> y — x). 3; gen 2 . H 

In line 1, “Ax 6” means that the formula belongs to axiom group 6. 
In line 3, “1, 2; T” means that this line is obtained from lines 1 and 2 by 
rule T. In line 4, “3; gen 2 ” means that the generalization theorem can be 
applied twice to line 3 to yield line 4. In the same spirit we write “MP,” 
“ded,” and “RAA” to refer to modus ponens, the deduction theorem, 
and reductio ad absurdum, respectively. 

It must be emphasized that the four numbered lines above do not 
constitute a deduction of V.x V v(a = y —► y — x). Instead they form 
a proof (in the meta-language we continue, with little justification, 
to call English) that such a deduction exists. The shortest deduction 
of V.rV v(x = y —> y = x) known to the author is a sequence of 
17 formulas. 

Example. I- x — y —*• V z Pxz —■► V z Pyz. 

Proof 

1. I- x — y —> Pxz Pyz■ Ax 6. 

2. LV' Pxz —> Pxz. Ax 2. 

3. I- x = y —► Vz Pxz —> Pyz. 1, 2; T. 

4. {x — y, V z Pxz } F Pyz. 3; MP 2 . 

5. {x = y, Vz Pxz] F V z Pyz. 4; gen. 

6 . F .x = y —* Vz Pxz —>• Vz Pyz. 5; ded 2 . H 

Example (Eq 5). Let / be a two-place function symbol. Then 

F V.xi V.x 2 Vyi Vv 2 (.xi = yi —>■ a 2 = yi —► fx\x 2 = fyiyi)- 

Proof. Two members of axiom group 6 are 

x\ = >’i —> fxix 2 = fxix 2 —> fxix 2 — fy\x 2 , 
a 2 = yi —>■ fx ix 2 = fy\x 2 —>• /xix 2 = /yi.y 2 . 
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From Vi x — x (in axiom group 5) we deduce 
fx 1*2 = fx ix 2 . 

The three displayed formulas tautologically imply 

xi = yi —>- x 2 = yi —i- fx\X2 = fyiyi- ^ 

Example 

(a) {Vi(Pi — *■ Qx), V z Pz} F- Qc. It is not hard to show that 
such a deduction exists. The deduction itself consists of seven 
formulas. 

(b) {Vi(Pi —> Qx), Vz Pz} F- Qy. This is just like (a). The 
point we are interested in here is that we can use the same seven- 
element deduction, with c replaced throughout by y. 

(c) (Vi(Pi — *■ Qx),V z Pz} h V y Qy. This follows from 

(b) by the generalization theorem. 

(d) (Vx(Pi — *■ Qx),Vz Pz} h V x Qx. This follows from 

(c) by use of the fact that V v Qy F- V x Qx. 

Parts (a) and (b) of the foregoing example illustrate a sort of inter¬ 
changeability of constant symbols with free variables. This interchange- 
ability is the basis for the following variation on the generalization 
theorem, for which part (c) is an example. Part (d) is covered by Corol¬ 
lary 24G. <p y is, of course, the result of replacing c by y in <p. 

Theorem 24F (Generalization on Constants) Assume that T F- <p 
and that c is a constant symbol that does not occur in T. Then there 
is a variable y (which does not occur in (p ) such that T F- V y <p c y . 
Furthermore, there is a deduction of V y <p y from T in which c 
does not occur. 

Proof. Let (ao, ..., a n ) be a deduction of <p from T. (Thus a n = 
cp.) Let y be the first variable that does not occur in any of the a, ’s. 
We claim that 


((«o ) c y , . ■., (a n ) c y ) (*) 

is a deduction from T of cp c . So we must check that each (a^, is 
in T U A or is obtained from earlier formulas by modus ponens. 

Case 1 \ ctk e T. Then c does not occur in a^. So (ai ( )‘ y = «/ f , 
which is in F. 

Case 2: a k is a logical axiom. Then («/,)'' is also a logical 
axiom. (Read the list of logical axioms and note that introducing 
a new variable will transform a logical axiom into another one.) 

Case 3: a k is obtained by modus ponens from a, and aj (which 
is (a, —► a k )) for i, j less than k. Then (aj) c y = ((a,-)$ —> (a k ) c y ). 
So ( a k ) c y is obtained by modus ponens from (a/) y and ( aj) y . 
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This completes the proof that (*) above is a deduction of <p' y . 
Let <l> be the finite subset of F that is used in (*). Thus (*) is 
a deduction of (p y from <J>, and y does not occur in O. So by 
the generalization theorem, <l> h V v <p' y . Furthermore, there is a 
deduction of V y <p y from <1> in which c does not appear. (For the 
proof to the generalization theorem did not add any new symbols 
to a deduction.) This is also a deduction from T of V y (p c v . ~\ 

We will sometimes want to apply this theorem in circumstances in 
which not just any variable will do. In the following version, there is a 
variable x selected in advance. 

Corollary 24G Assume that F h <p*, where the constant symbol 
c does not occur in T or in q>. Then F h Vn p, and there is a 
deduction of V x <p from F in which c does not occur. 

Proof. By the above theorem we have a deduction (without c) from 
F of V y(((p? : )y), where y does not occur in <p x . But since c does 
not occur in (p, 

(<P X cY y = <P X r 

It remains to show that V y <p* h \/ x (p. We can easily do this if 
we know that 


< yytf ) —*■ V 

is an axiom. That is, x must be substitutable for y in <p x , and 
(c p*) y x must be (p. This is reasonably clear; for details see the 
re-replacement lemma (Exercise 9). H 

Corollary 24H (Rule El) Assume that the constant symbol c does 
not occur in <p, i/r, or F, and that 

r; <p* c h yjr. 

Then 


T; 3 x (p F x/s 

and there is a deduction of i // from T; 3 x <p in which c does not 
occur. 

Proof. By contraposition we have 

T; -njr I- 1 cp*. 

So by the preceding corollary we obtain 
T; — i x/r |- Vx —i (p. 


Applying contraposition again, we have the desired result. H 
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“El” stands for “existential instantiation,” a bit of traditional termi¬ 
nology. 

We will not have occasion to use rule El in any of our proofs, but it 
may be handy in exercises. It is the formal counterpart to the reasoning: 

“We know there is an x such that_ x _. So call it c. Now from_c_ 

we can prove i//.” But notice that rule El does not claim that 3 x <p F- <p*, 
which is in fact usually false. 

Example, revisited. I- 3 xV yep —* V y 3 x <p. 

By the deduction theorem, it suffices to show that 

3 x V y tp F- V y 3 x <p. 

By rule El it suffices to show that 

V y <p* F- Vy 3x<p, 

where c is new to the language. By the generalization theorem it 
suffices to show that 


V y (p x c F- 3 x <p. 

Since V y < p* F- < p x c , it suffices to show that 

(p x c F- 3 a <p. 

By contraposition this is equivalent to 
Vx -up I- up*, 

which is trivial (by axiom group 2 and modus ponens). 

We can now see roughly how our particular list of logical axioms 
was formed. The tautologies were included to handle the sentential 
connective symbols. (We could economize considerably at this point by 
using only some of the tautologies.) Axiom group 2 reflects the intended 
meaning of the quantifier symbol. Then in order to be able to prove the 
generalization theorem we added axiom groups 3 and 4 and arranged 
for generalizations of axioms to be axioms. 

Axiom groups 5 and 6 will turn out to be just enough to prove the 
crucial properties of equality; see the subsection on equality. 

As we will prove in Section 2.5, every logical axiom is a valid for¬ 
mula. It might seem simpler to use as logical axioms the set of all valid 
formulas. But there are two (related) objections to doing this. For one, 
the concept of validity was defined semantically. That is, the definition 
referred to possible meanings (i.e., structures) for the language and to 
the concept of truth in a structure. For our present purposes (e.g., proving 
that the validities are effectively enumerable) we need a class A with 
a finitary, syntactical definition. That is, the definition of A involves 
only matters concerning the arrangement of the symbols in the logical 
axioms; there is no reference to matters of truth in structures. A second 
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objection to the inclusion of all valid formulas as axioms is that we 
prefer a decidable set A, and the set of validities fails to be decidable. 

Alphabetic Variants 

Often when we are discussing a formula such as 
Vx(x ^ 0 —► 3 y x = S_v) 

we are not interested in the particular choice of the variables x and y. 
We want ( x , y) to be a pair of distinct variables, but often it makes no 
difference whether the pair is (14, vg) or (v%, 14}. 

But when it comes time to substitute a term t into a formula, then 
the choice of quantified variables can make the difference between the 
substitutability of t and its failure. In this subsection we will discuss 
what to do when substitutability fails. As will be seen, the difficulty can 
always be surmounted by suitably juggling the quantified variables. 
For example, suppose we want to show that 

h Vx V y Pxy —* V y Pyy. 

There is the difficulty that y is not substitutable for x in V y Pxy, so 
the above sentence is not in axiom group 2. This is a nuisance resulting 
from an unfortunate choice of variables. For example, showing that 

h V .r V z Pxz —► V y Pyy 

involves no such difficulties. So we can solve our original problem if 
we know that 

h V x V y Pxy —>VxVzPxz, 

which, again, involves no difficulties. 

This slightly circuitous strategy (of interpolating V xV z Pxz be¬ 
tween V x V y Pxy and Vv Pyy) is typical of a certain class of prob¬ 
lems. Say that we desire to substitute a term 1 for x in a wff <p. If t is, 
in fact, not so substitutable, then we replace V x q> by V x <p', where t is 
substitutable for x in <p’. In the above example q> is V y Pxy and < 4 / is 

V z Pxz. In general <p’ will differ from cp only in the choice of quantified 
variables. But <p' must be formed in a reasonable way so as to be logi¬ 
cally equivalent to (p. For example, it would be unreasonable to replace 

V y Pxy by V x Pxx, or V y V z Qxyz by V z V z Qxzz. 

Theorem 241 (Existence of Alphabetic Variants) Let <p be a for¬ 
mula, 1 a term, and x a variable. Then we can find a formula <p' 
(which differs from (p only in the choice of quantified variables) 
such that 


(a) <p h tp' and <p' h < p ; 

(b) t is substitutable for x in <p’. 
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Proof. We consider fixed t and x, and construct <p' by recursion on 
<p. The first cases are simple: For atomic <p we take tp' = <p, and 
then (-1 (p) 1 — (-i tp'), (<p —»•i fr)' — (<p' —»• i ]/'). But now consider 
the choice of (V y (p) r . 

If y does not occur in t, or if y — x, then we can just take 
(V y (pY = Vy <p'. But for the general case we must change the 
variable. 

Choose a variable z that does not occur in <p' or t or x. Then 
define (V y q>Y — V z(yp') y - To verify that (b) holds, we note that 
Z does not occur in t and t is substitutable for x in <p' (by the 
inductive hypothesis). Hence (since x ^ z) t is also substitutable 
for x in (<p') y . To verify that (a) holds, we calculate: 

<p P- <p' by the inductive hypothesis; 

.'.Vy tp P- Vy <p'. 

Vy<p' F- (<p') y z since z does not occur in <p’\ 

V y <p' F V z(<p')l by generalization; 

.-.Vy<p F- Vz(<p') y . 

In the other direction, 

V z(yp') y z F- {{(p')l) z y , which is <p r by Exercise 9; 

<p' F- (p\ by the inductive hypothesis; 

:yz{(p') y z F- <p 

.'.V z((p') y F V v <p by generalization. 

The last step uses the fact that y does not occur free in (<p') } 7 unless 
y = z, and so does not occur free in \/ ziyp'Y. in any case. H 

The formulas <p' constructed as in the proof of this theorem will be 
called alphabetic variants of <p. The moral of the theorem is: One should 
not be daunted by failure of substitutability; the right alphabetic variant 
will avoid the difficulty. 

Equality 

We list here (assuming that our language includes —) the facts about 
equality that will be needed in the next section. First, the relation defined 
by V\ — V 2 is reflexive, symmetric, and transitive (i.e., is an equivalence 
relation): 

Eql: F- Vi x = x. 

Proof. Axiom group 5. H 


Eq2: P- V x V y (x — y —► y = x). 
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Proof. Page 122. H 

Eq3: h Vx V y V z(x — y —* y = z — * .v = z). 

Proof. Exercise 11. H 

In addition, we will need to know that equality is compatible with 
the predicate and function symbols: 

Eq4 (for a two-place predicate symbol P ): 

I- V x\ V.r 2 V yi V y 2 (xi — yi -+x 2 = y 2 —> Px\x 2 —* Py\yi). 

Similarly for 77-place predicate symbols. 

Proof. It suffices to show that 

{xi = yi,x 2 = y 2 , Pxix 2 } h Py\y 2 . 

This is obtained by application of modus ponens to the two mem¬ 
bers of axiom group 6 : 

x\ — yi —>• Px ix 2 —> Py\x 2 , 
x 2 = y 2 -+ Py\x 2 —► Py\y 2 . 

Eq5 (for a two-place function symbol /): 

I- Vxi Vx 2 V y i V y 2 (xi = y i —> x 2 — y 2 —* f x\x 2 = fy\y 2 ). 

Similarly for n-place function symbols. 

Proof. Page 122. H 

Final Comments 

A logic book in the bootstrap tradition might well begin with this section 
on a deductive calculus. Such a book would first state the logical axioms 
and the rules of inference and would explain that they are acceptable to 
reasonable people. Then it would proceed to show that many formulas 
were deducible (or deducible from certain nonlogical axioms, such as 
axioms for set theory). 

Our viewpoint is very different. We study, among other things, the 
facts about the procedure described in the preceding paragraph. And 
we employ in this any correct mathematical reasoning, whether or not 
such reasoning is known to have counterparts in the deductive calculus 
under study. 

Figure 8 is intended to illustrate the separation between (a) the level 
at which we carry out our reasoning and prove our results, and (b) the 
level of the deductive calculus which we study. 
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The study, in English, of the view below: 

If then jP 9?. 

F\ a j- fi A —»/? => r ~vx 

^ Vx —iSx 
t=Bx(Px-*- VxPx) 



Figure 8. The meta-language above, in which we study the object 
language below. 


Exercises 

1. For a term u, let u) be the expression obtained from u by replacing 
the variable x by the term t. Restate this definition without using 
any form of the word “replace” or its synonyms. Suggestion: Use 
recursion on u. (Observe that from the new definition it is clear that 
u* is itself a term.) 

2. To which axiom groups, if any, do each of the following formulas 
belong? 

(a) [(Vx Px V y Py ) -► Pz\ -► [Vx Px -*■ (V y Py -+ Pz)l 

(b) Vy[Vx(Px —* Px) —* (Pc — * Pc)]. 

(c) Vx 3 y Pxy —► 3 y Pyy. 

3. (a) Let 21 be a structure and let s : V —> |2l|. Define a truth assign¬ 

ment v on the set of prime formulas by 


v(a) = T iff l=a a[s]. 
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Show that for any formula (prime or not), 

t m = T iff ha Ct[s]. 

Remark'. This result reflects the fact that -i and —»• were treated 
in Chapter 2 the same way as in Chapter 1. 

(b) Conclude that if T tautologically implies <p, then T logically 
implies (p. 

4. Give a deduction (from 0) of V x qi —► 3 x <p. (Note that you should 
not merely prove that such a deduction exists. You are instead asked 
to write out the entire deduction.) 

5. Find a function / such that if a formula <p has a deduction of length 
n from a set F, and if x does not occur free in F, then Vx <p has a 
deduction from T of length f(n). The more slowly your function 
grows, the better. 

6 . (a) Show that if h a —► p. then h Via —*■ Vi /3. 

(b) Show that it is not in general true that a —► ft |= V x a —»• Vx fi. 

7. (a) Show that h 3 x(Px —*■ Vi Px). 

(b) Show that { Qx. V y(Qy —* V z Pz)} F Vi Px. 

8 . (Q2b) Assume that x does not occur free in a. Show that 

h (a —► 3 x ft) 3 x(a —* ft). 

Also show that, under the same assumption, we have Q3a: 
h (V jc /S —* a) <—* 3 x(j3 —>• a). 

9. (Re-replacement lemma) (a) Show by example that {(p*) y x is not in 
general equal to tp. And that it is possible both for x to occur in 
((p*) y x at a place where it does not occur in <p, and for x to occur in 
<p at a place where it does not occur in ( <p x y ) y x . 

(b) Show that if y does not occur at all in (p. then x is substitutable 
for y in <p* and {<Py) y x — (p. Suggestion: Use induction on (p. 

10. Show that 

V x V y Pxy I- V y V x Pyx. 

11. (Eq3) Show that 

h V x V y V z(x = y —>• y = z —► x = z). 

12. Show that any consistent set T of formulas can be extended to a 
consistent set A having the property that for any formula a. either 
a e A or (—1 a) e A. (Assume that the language is countable. Do 
not use the compactness theorem of sentential logic.) 

13. Show that h Py <->• V.r(v = y —> Px). 

Remarks: More generally, if t is substitutable for x in <p and x 
does not occur in t. then 

h [(pf Vx(x — t tp)]. 
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Thus the formula Vx(x = t —* qi) offers an alternative of sorts to 
the substitution cp*. 

14. Show that h (Vx((-i Px) —► Qx ) —> V Qy) —* Py)). 

15. Show that deductions (from 0) of the following formulas exist: 

(a) 3raV3rj8<->3 x(a V f). 

(b) Vx a V Vi ft —► Vx(a V /S). 

16. Show that deductions (from 0) of the following formulas exist: 

(a) 3 x(a A /l) —► 3 x a A 3 x f>. 

(b) Vx(a A yS) <->• Vx a A Vx /l. 

17. Show that deductions (from 0) of the following formulas exist: 

(a) Vx((* —y f) —*■(3x0' —y 3x f). 

(b) 3 x(Py A Qx) Py A 3x 0x. 

SECTION 2.5 _ 

Soundness and Completeness Theorems 

In this section we establish two major theorems: the soundness of our 
deductive calculus (F F- <p =>■ T \= <p) and its completeness (r \= 
ip =>■ F F- <p). We will then be able to draw a number of interesting 
conclusions (including the compactness and enumerability theorems). 
Although our deductive calculus was chosen in a somewhat arbitrary 
way, the significant fact is that some such deductive calculus is sound 
and complete. This should be encouraging to the “working mathemati¬ 
cian” concerned about the existence of proofs from axioms; see the 
Retrospectus subsection of Section 2.6. 

Soundness Theorem If F I- (p, then T |= <p. 

The soundness theorem tells that our deductions lead only to “cor¬ 
rect” conclusions — deductions would be rather pointless otherwise! 
The idea of the proof is that the logical axioms are logically implied by 
anything, and that modus ponens preserves logical implications. 

Lemma 25A Every logical axiom is valid. 

Proof of the Soundness Theorem, Assuming the Lemma. We show 
by induction that any formula <p deducible from T is logically im¬ 
plied by T. 

Case 1: <p is a logical axiom. Then by the lemma (= <p, so a 
fortiori T |= cp. 

Case 2: (p e r. Then clearly T |= <p. 

Case 3: <p is obtained by modus ponens from f and i fr —► < p, 
where (by the inductive hypothesis) T |= \jr and T |= Qf —y <p). 
It then follows at once that Y \= <p. 3 
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It remains, of course, to prove that lemma. We know from Exercise 6 
of Section 2.2 that any generalization of a valid formula is valid. So it 
suffices to consider only logical axioms that are not themselves gener¬ 
alizations of other axioms. We will examine the various axiom groups 
in order of complexity. 

Axiom group 3: See Exercise 3 of Section 2.2. 

Axiom group 4: See Exercise 4 of Section 2.2. 

Axiom group 5: Trivial. 21 satisfies x — x with s iff s(x ) = s(x), 
which is always true. 

Axiom group 1: We know from Exercise 3 of the preceding section 
that if 0 tautologically implies a, then 0 |= a. And that is just what we 
need. 

Axiom group 6 (for an example, see Exercise 5 of Section 2.2): As¬ 
sume that a. is atomic and a' is obtained from a by replacing x at some 
places by y. It suffices to show that 

{x — y, a] \= a!. 

So take any 21, s such that 

ha x = >’[■■>']• i-e., s(x) = s(y). 

Then any term t has the property that if t' is obtained from 1 by replacing 
x at some places by y, then sit) = s(t'). This is obvious; a full proof 
would use induction on t. 

If a is t\ = t 2 , then a' must be t[ = fj, where t\ is obtained from /, as 
described. 

|=a «h] iff sit i) = s(t 2 ), 
iff s(f]) = ihj). 
iff ha a h]- 

Similarly, if a is Pt\ ■ ■ ■ t n , then a' is Pt[ ■ ■ ■ t' n and an analogous 
argument applies. 


Finally, we come to axiom group 2. It will be helpful to consider first 
a simple case: we will show that Vx Px —>- Pt is valid. Assume that 

ha V x Px[s]. 


Then for any d in |2l|, 

ha Px[six | d)]. 

So in particular we may take d — s(r): 

ha^hhh(O)]- (a) 

This is equivalent (by the definition of satisfaction of atomic formulas) 
to 


sit) e P % , 
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which in turn is equivalent to 

l= 2 i Pt[s]. (b) 

For this argument to be applicable to the nonatomic case, we need a 
way of passing from (a) to (b). This will be provided by the substitution 
lemma below, which states that 

iff NatfM 

whenever t is substitutable for x in <p. 

Consider a fixed 21 and .v. For any term u, let u x be the result of 
replacing the variable x in u by the term t. 

Lemma 25B s(u x ) = s(x | s(t))(u). 

This looks more complicated than it is. It asserts that a substitution 
can be carried out either in the term u or in s, with equivalent results. 
The corresponding commutative diagram is shown. 

substitution 

terms-► terms 

\ of t for 

j(x|5(0) \ 

1*1 

Proof. By induction on the term u. If u is a constant symbol or 
a variable other than x, then u x t = u and the desired equation 
reduces to s(u) = s ( u ). If u — x, then the equation reduces to 
s(t) = J(t). The inductive step, although cumbersome to write, 
is mathematically trivial. H 

The substitution lemma is similar in spirit; it states that a substitution 
can be carried out either within <p or in s, with equivalent results. For 
an example see Exercise 10 of Section 2.2. 

Substitution Lemma If the term t is substitutable for the variable x 
in the wff <p, then 

hoi ^ f x [s] iff 1=21 <p[s{x | J(0)]- 

Proof. We use induction on <p to show that the above holds for 
every .s. 

Case 1: <p is atomic. Then the conclusion follows from the 
preceding lemma. For example, if q> is Pit for some term u, then 

|= a Pu x [s] iff s{u x t ) e P % , 

iff s(x | J(t))(u) e P 21 by Lemma 25B, 
iff |= 2 , Pu[s(x\s(t))]- 
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Case 2: ip is -i ip or ip —>► 9. Then the conclusion for <p follows 
at once from the inductive hypotheses for i p and 0 . 

Case 3: ip is V y ip, and x does not occur free in <p. Then 5 and 
s(x | s(t )) agree on all variables that occur free in q>. And also ip? 
is simply (p. So the conclusion is immediate. 

Case 4: q> is Vv ip, and x does occur free in q>. Because t is 
substitutable for x in ip, we know that y does not occur in t and t 
is substitutable for x in ip (see the definition of “substitutable”). 
By the first of these, 

s(t) = s(y | d)(t) (*) 

for any d in |2l|. Since x ^ y,<pf = V y ip?. 

|= 2 i (p?[s] iff for every d, |= a ip?[s(y \ d)], 

iff for every d, \=% ip[s(y \ d)(x \ s(f))] by 
the inductive hypothesis and (*), 
iff l=a <p[s(x |i(f))]. 

So by induction the lemma holds for all <p. H 

Axiom group 2: Assume that t is substitutable for x in q>. Assume 
that 21 satisfies V x <p withWe need to show that <pf [.?]. We know 
that for any d in |2l|, 

Na I d)]. 

In particular, let d = s(t): 

Na <P\s(x |s(f))], 

So, by the substitution lemma, 

Na (ft t'^]' 

Hence V x <p —> ip? is valid. 

This completes the proof that all logical axioms are valid. And so the 
soundness theorem is proved. 

Corollary 25C If I-(<£><—> ip), then cp and ip are logically equiva¬ 
lent. 

Corollary 25D If <p' is an alphabetic variant of ip (see Theo¬ 
rem 241), then <p and ip' are logically equivalent. 

Recall that a set T is consistent iff there is no formula ip such that 

both T h ip and T I- up. Define T to be satisfiable iff there is some 21 

and s such that 21 satisfies every member of T with s. 

Corollary 25E If T is satisfiable, then T is consistent. 

This corollary is actually equivalent to the soundness theorem, as the 
reader is invited to verify. 
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The completeness theorem is the converse to the soundness theorem 
and is a deeper result. 

Completeness Theorem (Godel, 1930) 

(a) If T |\= <p, then TFf 

(b) Any consistent set of formulas is satisfiable. 

Actually parts (a) and (b) are equivalent; cf. Exercise 2. So it suffices 
to prove part (b). We will give a proof for a countable language; later 
we will indicate what alterations are needed for languages of larger 
cardinality. (A countable language is one with countably many symbols, 
or equivalently, by Theorem OB, one with countably many wffs.) 

The ideas of the proof are related to those in the proof of the com¬ 
pactness theorem for sentential logic. We begin with a consistent set T. 
In steps 1-3 we extend T to a set A of formulas for which 

(i) r C A. 

(ii) A is consistent and is maximal in the sense that for any formula 
a, either a e A or (—i a) e A . 

(iii) For any formula (p and variable x, there is a constant c such that 

(—- Vx <tp —► -xp*) e A. 

Then in step 4 we form a structure 21 in which members of T not 
containing = can be satisfied. |2l| is the set of terms, and for a predicate 
symbol P, 

{h, e P % iff Pt\ ■ ■ ■ t„ e A. 

Finally, in steps 5 and 6 we change 21 to accommodate formulas con¬ 
taining the equality symbol. 

It is suggested that on a first reading the details which are provided 
for most of the steps be omitted. Once the outline is clearly in mind, the 
entire proof should be read. (The nondetails are marked with a stripe in 
the left margin.) 

Proof. Let T be a consistent set of wffs in a countable language. 

H 

Step 1 : Expand the language by adding a countably infinite set of 
new constant symbols. Then T remains consistent as a set of wffs 
in the new language. 

Details: If not then for some (i, there is a deduction (in the expanded 
language) of (/3 A -i /3) from T. This deduction contains only finitely 
many of the new constant symbols. By the theorem for generalization 
on constants (Theorem 24F), each can be replaced by a variable. We 
then have a deduction (in the original language) of (/3' A -i /S') from T. 
This contradicts our assumption that T was consistent. 
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Step 2: For each wff <p (in the new language) and each variable x, 
we want to add to T the wff 

where c is one of the new constant symbols. (The idea is that c 
volunteers to name a counterexample to q>, if there is any.) We 
can do this in such a way that T together with the set 0 of all the 
added wffs is still a consistent set. 

Details: Adopt a fixed enumeration of the pairs (<p, x), where (p is a 
wff (of the expanded language) and x is a variable: 


{(P2,X 2 ), {(p3,X 2 ), .... 


This is possible since the language is countable. Let 9\ be 

—► ~><pi 

where c i is the first of the new constant symbols not occurring in <p\. 
Then go on to (pp 2 , x 2 ) and define 0 2 . In general, 9„ is 

1 ^ Xn^Pn * 1 (Pn Cn . 

where c„ is the first of the new constant symbols not occurring in tp n or 
in 9k for any k < n. 

Let © be the set {9\ ,0 2 ,...}. We claim that F U © is consistent. If 
not, then (because deductions are finite) for some m > 0, 

T U { 6 >i,..., 9 m ,9 m+ 1 } 

is inconsistent. Take the least such m. Then by RAA 
T U [9i ,...., 9 m } \ * 9 m+ 1 . 


Now 9 m+ 1 is 


-iVjc <p —> -i y x c 

for some x, (p, and c. So by rule T, we obtain the two facts: 

F U {0i, .. ,9 m } I iVr <p, 
ru {9 u ...9 m }\-<p* c . W 

Since c does not appear in any formula on the left side, we can apply 
the Corollary 24G to the second of these, obtaining 

T U {0i,..., 9 m } h Vx <p. 


This and (*) contradict the leastness of m (or the consistency of T, if 

m — 0 ). 
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Step 3: We now extend the consistent set F U © to a consistent set 
A which is maximal in the sense that for any wff cp either <p e A 
or (-i cp) e A. 

Details: We can imitate the proof used at the analogous place in 
the proof of sentential compactness in Section 1.7. Or we can argue as 
follows: Let A be the set of logical axioms for the expanded language. 
Since F U © is consistent, there is no formula P such that F U 0 U A 
tautologically implies both ft and -i /3. (This is by Theorem 24B; the 
compactness theorem of sentential logic is used here.) Hence there is 
a truth assignment v for the set of all prime formulas that satisfies 
TU©U A. Let 


A = {(p\ v((p) = T}. 

Clearly for any ip either e A or (— \<p) & A but not both. Also we 
have 

Ah A tautologically implies tp (since A C A), 

=)> v((p) — T since v satisfies A, 

=>■ <p e A. 

Consequently, A is consistent, lest both q> and ( -i <p) belong to A. 

Actually, regardless of how A is constructed, it must be de¬ 
ductively closed. That is, 

A h ^ AK - \<p by consistency, 

a, 

=>■ <p e A by maximality. 

Step 4: We now make from A a structure 21 for the new language, 
but with the equality symbol (if any) replaced by a new two-place 
predicate symbol E. 21 will not itself be the structure in which T 
will be satisfied but will be a preliminary structure. 

(a) 1211 = the set of all terms of the new language. 

(b) Define the binary relation Zs 21 by 

(u, t) e L 21 iff the formula u — t belongs to A. 

(c) For each /(-place predicate parameter P, define the n-ary 
relation P' [ by 

(h . t n ) e P % iff Pti---t n e A. 

(d) For each //-place function symbol /, let / 2l be the function 
defined by 
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This includes the n — 0 case; for a constant symbol c we take c a = c. 
Define also a function s : V -> 2l|, namely the identity function 
.?(■*) = x on V. 

It then follows that for any term f, s(t) = t. For any wff tp , let <p * 
be the result of replacing the equality symbol in ^ by E. Then 

iff 

Details: That ~s(t) — t can be proved by induction on t, but the proof 
is entirely straightforward. 

The other claim, that 

ha <?*[>] iff e A, 

we prove by induction on the number of places at which connective or 
quantifier symbols appear in (p. 

Case 1: Atomic formulas. We defined 21 in such a way as to make 
this case immediate. For example, if <p is Pt, then 

ha Ptb] iff s(t) e P a , 
iff t e P % , 
iff Pt e A. 

Similarly, 

ha uEt[s] iff (s(u), s(t)) e Zs 21 , 
iff (, u,t) e E % , 
iff u — t e A. 

Case 2: Negation. 

ha(-h>*h] if ha?>*M, 

iff <p A by inductive hypothesis, 
iff (—i cp) e A by properties of A. 

Case 3: Conditional. 

ha(«e —iff ha^Mor ha 

iff q> A or \jr e A by inductive hypothesis, 
iff (—i (p) e A or i/r e A, 

=>■ Ah (cp —ye), in fact tautologically, 

=p <p ^ A or [<p e A and A h \//], 

=p (—i (p) e A or r/r e A, 

which closes the loop. And 

Ah {up —* ijr) iff {(p —>• \jr) e A. 

(This should be compared with Exercise 2 of Section 1.7.) 

Case 4: Quantification. We want to show that 


ha Vi/[j] iff Vxijj 6 A. 
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(The notational ambiguity is harmless since Vx (<£>*) is the same as 
(Vx <p )*.) A includes the wff 9: 

-i V x (p —► -i <p x c . 

To show that 

l= 2 i Vx </>*[s] =>- Vx <p e A, 

we can argue: If <p* is true of everything, then it is true of c, whence by 
the inductive hypothesis (pf. e A. But then Vx (p e A, because c was 
chosen to be a counterexample to q> if there was one. In more detail: 

ha Vx <?*[.?] =>• ha <P*b(x | c)] 

=> |=a (<P*)*[.j] by the substitution lemma 
=>■ ha (#)*[*], this being the same formula 
=?■ (p* e A by the inductive hypothesis 

=> (“i cpf.) ^ A by consistency 
=> hVxp) ^ A since 9 e A and A is 
deductively closed 

=tVxi?e A. 

(This is our only use of ©. We needed to know that if (—i Vx cp) e A, 
then for a particular c we would have (—i <p £) e A.) 

We turn now to the converse. We can almost argue as follows: 

ha Vx <p*|>] => ha V *[•'>’(x | f)] for some / 

ha by the substitution lemma 

=>■ ^ A by the inductive hypothesis 

Vx cp A since A is deductively closed. 

The flaw here is that the two wavy implications require that t be 
substitutable for x in (p. This may not be the case, but we can use the 
usual repair: We change to an alphabetic variant xjf of <p in which t is 
substitutable for x. Then 

ha Vx <p*[s] =>■ ha <p*h(x | f)] for some t, henceforth fixed 

=>■ ha is*[six 1 1 )] by the semantical equivalence of 

alphabetic variants (Corollary 25D) 
=>■ ha (V'/hh] by the substitution lemma 
=>■ iff A by the inductive hypothesis 

=>■ V x A since A is deductively closed 

=F V x <p A by the syntactical equivalence of 

alphabetic variants (Theorem 241). 

This completes the list of possible cases; it now follows by induction 
that for any <p. 


ha iff A. 
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If our original language did not include the equality symbol, then 
we are done. For we need only restrict 21 to the original language to 
obtain a structure that satisfies every member of T with the identity 
function. 

But now assume that the equality symbol is in the language. Then 
21 will no longer serve. For example, if T contains the sentence 
c — d (where c and d are distinct constant symbols), then we need a 
structure 25 in which c® = <Z®. We obtain IB as the quotient structure 
21/ E of 21 modulo Zs 21 . 

Step 5: Zs 21 is an equivalence relation on |2l|. For each t in |2l| let 
[r] be its equivalence class. Zs 21 is, in fact, a congruence relation 
for 21. This means that the following conditions are met: 

(i) Zs 21 is an equivalence relation on |2l|. 

(ii) P' 1 is compatible with Z? 21 for each predicate symbol P: 

(q,..., t„) e P a and t, E % t\ for 1 < i < n => {t[,, t’ n ) e P a . 

(iii) / 21 is compatible with Z? 21 for each function symbol /: 

tiE*tl for 1 </<«=> / a (/i, • ■ •, t n )E*f*(t[, ..., Q. 

Under these circumstances we can form the quotient structure 21/ E, 
defined as follows: 

(a) |2l/Zs| is the set of all equivalence classes of members of |2l|. 

(b) For each n -place predicate symbol P, 

([fi],...,[t„]) e P* /E iff (tu...,tn) e P 21 . 

(c) For each n -place function symbol /, 

f* /E {[h],...,[tn\)=[f % {h,...,tn)]. 

This includes the n — 0 cases: 

c % ' E = [c a ]. 

Let h : |2l| —> |2l/Zs| be the natural map: 

hit) = [t]. 

Then h is a homomorphism of 21 onto 21 / E. Furthermore, E'~ l/E 
is the equality relation on |2l/Zs|. Consequently, for any <p\ 

(p e A ^ 2 i 

Na/£ <P*[h o s] 

O I=a/£ <p[h 

So 21/ E satisfies every member of A (and hence every member of 
T) with h o s. 
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Details: Recall that 

tE m t' iff (r = f')e A, 
iff Ah t = t'. 

(i) E a is an equivalence relation on 21 by properties Eql, Eq2, and 
Eq3 of equality. 

(ii) is compatible with E a by property Eq4 of equality. 

(iii) f' 21 is compatible with E a by property Eq5 of equality. 

It then follows from the compatibility of P 21 with E a that P^ E is 
well defined. Similarly, f' l ^ E is well defined because / a is compatible 
with E a . 

It is immediate from the construction that h is a homomorphism of 
21 onto 21/E. And 

[t]E^ E [P] iff fE a f', 

iff [t] = [t r ]. 

Finally, 

(p e A <s> |= a ip*[i] by step 4 

\=<h/e <p*[h o s] by the homomorphism theorem 

<$■ ha/£ ° J ]> 

the last step being justified by the fact that E^ e is the equality relation 
on |2l/E|. 

Step 6: Restrict the structure 21/E to the original language. This 
restriction of 21/E satisfies every member of T with h o s. H 

For an uncountable language, a few modifications to the foregoing 
proof of the completeness theorem are needed. Say that the language has 
cardinality X. (By this we mean that it has X symbols or, equivalently, 
X formulas.) We will describe the modifications needed, assuming the 
reader has a substantial knowledge of set theory. In step 1 we add X 
new constant symbols; the details remain unchanged. In step 2, only the 
details change. The cardinal X is an initial ordinal. (We have tacitly well 
ordered the language here.) “Enumerate” the pairs 


{(Pcti Xa)a<X 

indexed by ordinals less than X. For a < X, 0 a is 
1 V x a cp a * (“i i 

where c a is the first of the new constant symbols not in <p a or in dp for 
any /S < a. (This excludes at most Ro • card(a) constant symbols, so 
there are some left.) Finally, in step 3, we can obtain the maximal set A 
by use of Zorn’s lemma. The rest of the proof remains unchanged. 
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Compactness Theorem (a) If T |= <p, then for some finite To c V 
we have To |= <p. 

(b) If every finite subset To of T is satisfiable, then V is satis- 
fiable. 

In particular, a set E of sentences has a model iff every finite subset 
has a model. 

Proof To prove part (a) of the compactness theorem, we simply 
observe that 

r |= q> => r i- <p 

=>• To h (p for some finite To c r, deductions being finite 

=> ro N v- 

Part (b) has a similar proof. If every finite subset of F is sat¬ 
isfiable, then by soundness every finite subset of V is consistent. 
Thus T is consistent, since deductions are finite. So by complete¬ 
ness, F is satisfiable. (Actually parts (a) and (b) are equivalent; 
cf. Exercise 3 of Section 1.7.) H 

When a person first hears of the compactness theorem, his natural 
inclination is to try to combine (by some algebraic or set-theoretic op¬ 
eration) the structures in which the various finite subsets are satisfied, 
in such a way as to obtain a structure in which the entire set is satisfied. 
In fact, such a proof is possible; the operation to use is the ultraprod¬ 
uct construction. But we will refrain from digressing further into this 
intriguing possibility. 

Notice that the compactness theorem involves only semantical no¬ 
tions of Section 2.2; it does not involve deductions at all. And there are 
proofs that avoid deductions. The same remarks apply to the following 
theorem. 

* Enumerability Theorem For a reasonable language, the set of valid 
wffs can be effectively enumerated. 

By a reasonable language we mean one whose set of parameters can 
be effectively enumerated and such that the two relations 

{{P, n) | P is an n -place predicate symbol} 

and 


{(/, n ) I / is an n-place function symbol} 

are decidable. For example, any language with only finitely many pa¬ 
rameters (such a language will be called a finite language ) is certainly 
reasonable, because finite sets are always decidable. On the other hand, 
a reasonable language must be countable, since we cannot effectively 
enumerate an uncountable set. (In fact, a stronger statement applies: As 



Chapter 2: First-Order Logic 


143 


in Section 1.7, a suitable input/output format is needed in which the 
underlying set of symbols communicated is finite, which implies that 
the set of all strings is countable.) 

A precise version of this theorem will be given in Section 3.4. (See 
especially item 20 there.) The proofs of the two versions are in essence 
the same. 

Proof. The essential fact is that A, and hence the set of deductions, 
is decidable. 

Suppose that we are given some expression s. (The assumption 
of reasonableness enters already here. There are only countably 
many things eligible to be given by one person to another.) We 
want to decide whether or not s is in A. First we check that 
s has the syntactical form necessary to be a formula. (For sen¬ 
tential logic we gave detailed instructions for such a check; see 
Section 1.3. Similar instructions can be given for first-order lan¬ 
guages, by using Section 2.3.) If s passes that test, we then check 
(by constructing a truth table) to see if s is a generalization of a 
tautology. If not, then we proceed to see if s has the syntactical 
form necessary to be in axiom group 2. And so forth. If s has not 
been accepted by the time we finish with axiom group 6, then s 
is not in A. 

(The above is intended to convince the reader that he really can 
tell members of A from nonmembers. The reader who remains 
dubious can look forward to the rerun in Section 3.4.) 

Since A is decidable, the set of tautological consequences of 
A is effectively enumerable; see Theorem 17G. But 

{a | a is a tautological consequence of A} 

= {a | F- a} by Theorem 24B, 

= {a | a is valid}. _l 


An alternative to the last paragraph of this proof is the following 
argument, which is possibly more illuminating; First we claim that the 
set of deductions (from 0) is decidable. For given a finite sequence 
ao,... ,a„ we can examine each a, in turn to see if it is in A or is 
obtainable by modus ponens from earlier members of the sequence. 
Then to enumerate the validities, we begin by enumerating all finite 
sequences of wffs. We look at each sequence as it is produced and 
decide whether or not it is a deduction. If not, we discard it. But if it is, 
then we put its last member on the list of validities. Continuing in this 
way, we generate — in an inefficient way — a list on which any valid 
formula will eventually appear. 

*Corollary 25F Let T be a decidable set of formulas in a reason¬ 
able language. 
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(a) The set of theorems of T is effectively enumerable. 

(b) The set {cp | T |= ^ 5 } of formulas logically implied by T is 
effectively enumerable. 

(Of course parts (a) and (b) refer to the same set. This corollary 
includes the enumerability theorem itself, in which F = 0.) 

Proof 1 . Enumerate the validities; whenever you find one of the 
form 


—►-► oil —► <*o> 

check to see if a n ,..., a\ are in T. If so, then put «o on the list of 
theorems of T. In this way, any theorem of T is eventually listed. 

H 


Proof 2 . T U A is decidable, so its set of tautological consequences 
is effectively enumerable. And that is just the set we want. H 

For example, let T be the (decidable) set of axioms for any of the 
usual systems of set theory. Then this corollary tells us that the set of 
theorems of set theory is effectively enumerable. 

More generally, in setting up some axiomatic theory, it is natural 
to insist that the set of axioms be decidable. After all, we want proofs 
from these axioms to be convincing arguments that can be verified. Part 
of the verification process involves checking that statements alleged 
to be axioms are indeed axioms. For this to be possible, the set of 
axioms needs to be decidable (or at least semidecidable). This has the 
consequence that the set of theorems that follow from the axioms is 
effectively enumerable. 

*Corollary 25G Assume that T is a decidable set of formulas in 
a reasonable language, and for any sentence cr either F |= 0 or 
F ^ -id. Then the set of sentences implied by I' is decidable. 

Proof. If F is inconsistent, then we have simply the (decidable) 
set of all sentences. So assume that T is consistent. Suppose 
that we are given a sentence cr and asked to decide whether or 
not T |= cr. We can enumerate the theorems of I' and look for 
cr or 1 cr. Eventually one will appear, and then we know the 
answer. H 

(Observe that this proof actually describes two decision procedures. 
One is correct when F is inconsistent, the other is correct when I' is con¬ 
sistent. So in either case a decision procedure exists. But we cannot nec¬ 
essarily determine effectively, given a finite description of T, which one 
is to be used. A set is decidable if there exists a decision procedure for it. 
That is not the same as having a known decision procedure in our hands.) 
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It should be remarked that our proofs of enumerability cannot, in 
general, be strengthened to proofs of decidability. For almost all lan¬ 
guages the set of validities is not decidable. (See Church’s Theorem, 
Section 3.5.) 

Historical Notes 

The completeness theorem (for countable languages) was contained in 
the 1930 doctoral dissertation of Kurt Godel. (It is not to be confused 
with the “Godel incompleteness theorem,’’ published in 1931. We will 
consider this latter result in Chapter 3.) The compactness theorem (for 
countable languages) was given as a corollary. 

The compactness theorem for uncountable languages was implicit 
in a 1936 paper by Anatolii Mal'cev. His proof used Skolem functions 
(cf. Section 4.2) and the compactness theorem of sentential logic. The 
first explicit statement of the compactness theorem for uncountable 
languages was in a 1941 paper by Mal'cev. 

The enumerability theorem, as well as following from Godel’s 1930 
work, was also implicit in results published in 1928 by Thoralf 
Skolem. 

The proof we have given for the completeness theorem is patterned 
after one given by Leon Henkin in his dissertation, published in 1949. 
Unlike Godel’s original proof, Henkin’s proof generalizes easily to lan¬ 
guages of any cardinality. 


Exercises 

1. (Semantical rule El) Assume that the constant symbol c does not 
occur in (p , i/r, or F, and that T; (p* \= xj/. Show (without using the 
soundness and completeness theorems) that T; 3 x cp |= xjr. 

2. Prove the equivalence of parts (a) and (b) of the completeness the¬ 
orem. Suggestion: F |= <p iff T U {—i q>] is unsatisfiable, And A 
is satisfiable iff A ^ _L, where _L is some unsatisfiable, refutable 
formula like —i V jc x = x. 

Remark: Similarly, the soundness theorem is equivalent to the 
statement that every satisfiable set of formulas is consistent. 

3. Assume that T F- (p and that P is a predicate symbol which occurs 
neither in F nor in <p. Is there a deduction of <p from F in which 
P nowhere occurs? Suggestion: There are two very different ap¬ 
proaches to this problem. The “soft” approach makes use of two 
languages, one that contains P and one that does not. The “hard” 
approach considers the question whether P can be systematically 
eliminated from a given deduction of q> from F. 
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4. Let r = {—i V V\Pv\, Pv 2 , Pv 3 ,.. .}• Is F consistent? Is V satisfi- 
able? 

5. Show that an infinite map can be colored with four colors iff every 
finite submap of it can be. Suggestion: Take a language having a 
constant symbol for each country and having four one-place predicate 
symbols for the colors. Use the compactness theorem. 

6 . Let Ei and E 2 be sets of sentences such that nothing is a model 
of both Ei and E 2 . Show that there is a sentence r such that 
Mod Ei C Mod r and Mod E 2 c Mod —1 r. (This can be stated: 
Disjoint ECa classes can be separated by an EC class.) Suggestion: 
E[ U E 2 is unsatisfiable; apply compactness. 

7. The completeness theorem tells us that each sentence either has a 
deduction (from 0) or has a counter-model (i.e., a structure in which 
it is false). For each of the following sentences, either show there is 
a deduction or give a counter-model. 

(a) V.r(g.r —► Vy Qy) 

(b) (3 x P.x —> V y Qy) —> V z{Pz —>• Qz) 

(c) V z(Pz —* Qz) —► (3 x Px —> V >’ gy) 

(d) —1 3 yVx(Pxy -1 Pxx) 

8 . Assume the language (with equality) has just the parameters V and 
P, where P is a two-place predicate symbol. Let 21 be the structure 
with |2l| = Z, the set of integers (positive, negative, and zero) and 
with (a, b) e iff \a — h\ — 1. Thus 21 looks like an infinite 
graph: 


Show that there is an elementarily equivalent structure 23 that is not 
connected. (Being connected means that for every two members of 
1231, there is a path between them. A path — of length n —from a 
to b is a sequence (po, p\, ..., p n ) with a — po and b = p n and 
{Pi,Pi+ 1 > e for each i.) Suggestion: Add constant symbols c 
and d. Write down sentences saying c and d are far apart. Apply 
compactness. 

9. In Section 2.4 we used a certain set A of logical axioms. That set 
can be altered, within limits. 

(a) Suppose we add to A some formula that is not valid. Show 
that the soundness theorem now fails. 

(b) At the other extreme, suppose we take no logical axioms at all: 
A — 0. Show that the completeness theorem now fails. 

(c) Suppose we modify A by adding one new valid formula. Explain 
why both the soundness theorem and the completeness theorem 
still hold. 
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SECTION 2.6 _ 

Models of Theories 

In this section we will leave behind deductions and logical axioms. 
Instead we return to topics discussed in Section 2.2. But now, in the 
presence of the theorems of the preceding section, we will be able to 
answer more questions than we could before. 

Finite Models 

Some sentences have only infinite models, for example, the sentence 
saying that < is an ordering with no largest element. The negation of 
such a sentence is finitely valid , that is, it is true in every finite structure. 

It is also possible to have sentences having only finite models. For 
example, any model of V.v V y x = y has cardinality 1. But if all models 
of £ are finite, then there is a finite bound on the size of the models, by 
the following theorem. 

Theorem 26A If a set £ of sentences has arbitrarily large finite 
models, then it has an infinite model. 

Proof. For each integer k > 2, we can find a sentence A* that 
translates, “There are at least k things.” For example, 

A 2 = 3 v\ 3 v 2 v x ^ v 2 , 

X 2 — 3 v\ 3 v 2 3 v 2 (v\ / A t»i ^ 1)3 A ^ D 3 ). 

Consider the set 


£ U {A 2 , A 3 , ...}. 

By hypothesis any finite subset has a model. So by compactness 
the entire set has a model, which clearly must be infinite. H 

For example, it is a priori conceivable that there might be some very 
subtle equation of group theory that was true in every finite group but 
false in every infinite group. But by the above theorem, no such equation 
exists. 

The proof to this theorem illustrates a useful method for obtaining a 
structure with given properties. One writes down sentences (possibly in 
an expanded language) stating the properties one wants. One then argues 
that any finite subset of the sentences has a model. The compactness 
theorem does the rest. We will see more examples of this method in the 
coming pages. 

Corollary 26B The class of all finite structures (for a fixed lan¬ 
guage) is not ECa- The class of all infinite structures is not EC. 
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Proof. The first sentence follows immediately from the theorem. 
If the class of all infinite structures is Mod r, then the class of all 
finite structures is Mod -i r. But this class isn’t even EC a, much 
less EC. H 

The class of infinite structures is ECa, being Modj/, 2 , A 3 ,...}. 

Next we want to consider decision problems connected with finite 
structures. For any structure 21, define the theory of 21, written Th2l, 
to be the set of all sentences true in 21. For a. finite structure 21, is Th2l 
decidable? Is the set of sentences having finite models decidable? 

The following observations will help here: 

1. Any finite structure 21 is isomorphic to a structure with universe 
{1,2where n is the size of 21 (i.e., n — card |2l|). The idea here 
is, where |2l| = {ai,..., «„}, simply to replace a, by i. 

For example, suppose the language has only the parameters V and 
a two-place predicate symbol E (for the “edge” relation in a directed 
graph). Consider the finite structure 03 with universe I®! consisting of 
a set of four distinct objects {a, b, c, d], and with 

= {(A,fi>, (, b,a ), (b,c), (c, c)}. 

Then 03 is isomorphic to the structure 

({1,2, 3, 4}; {(1, 2>, (2, 1>, (2, 3>, (3, 3>}). 

But there are other possibilities here; if we had focused on the mem¬ 
bers of 1 03 1 in the order b, a, d,c then we would have arrived at the 
isomorphic (but different) structure 

({1,2, 3, 4}; {(1,2), (2, 1), (1, 4), (4, 4>}). 

2. A finite structure of the sort just described can, for a finite lan¬ 
guage, be specified by a finite string of symbols. In the example, 

({1,2, 3, 4}; {(1,2), (2,1), (2, 3), (3,3)}), 

the above line completely specifies the structure, and it can be written 
down with numerals in base 10 (or your favorite base) along with punc¬ 
tuation and delimiters (e.g., parentheses). Therefore such a structure can 
be communicated to another person or to a machine. The finite string of 
symbols can be written down in a suitable input format. 

3. Given a finite structure 21 for a finite language, with universe 
{ 1 , ..., n} (and by the preceding observation, it is possible to be given 
such an object), a wff (p, and an assignment s v of numbers in this universe 
to the variables free in ip (there are only finitely many, of course), we 
can effectively decide whether or not |=<a (p[s^\. 
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For example, given 

<33 = ({1,2, 3, 4}; {(1,2), (2,1), (2, 3), (3, 3)}) 

(p = V Vi(( —I V V2 —i EVnVi ) —*■ EV\V\), 

we can organize the computation in the tree form shown in Fig. 9. We 
find that the sentence (p, which says “Anything in the range of E is 
related to itself,” is false in 03. 



Figure9. CheckingthesentenceVi>i((-'ViJ2 _, £'u2i , i) —*■ Ev\V\) in a structure 
with universe of size 4 . 

At each leaf in the tree (i.e., each minimal vertex), we have an atomic 
formula and we do a “table look-up” to see if it is satisfied. Notice that 
each quantifier triggers a search through the /(-element universe. For 
a formula <p with k quantifiers, the number of leaves in the tree will 
be bounded by a polynomial in n of degree k. If the language contains 
function symbols, then each term needs to be evaluated, using the (finite) 
function provided by the structure. 

In particular, restricting ourselves to sentences, we can effectively 
decide, given 21 as above and a sentence er, whether or not 21 is a 
model of a. (In fact here a could even be a second-order sentence, as in 
Chapter 4.) 

*Theorem 26C For a finite structure 21 in a finite language, Th 21 is 
decidable. 
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Proof 1. By observation 1, we can replace 21 by an isomorphic 
structure with universe of the form {1without changing 
which sentences are true. Then apply observation 3. H 

Proof 2. By Exercise 17(a) of Section 2.2, there is a sentence iSg 
that specifies 21 up to isomorphism. It follows that 

Th21 = {cx\8 m 

(Details: For “c” note that if a is true in 21, then it is true in all 
isomorphic copies, and hence in all models of Sy. So <5<a |= cr. The 
other direction is simpler; if |= cr then a is true in all models 
of 8<n, of which 21 is one.) Apply Corollary 25G, noting that for 
each cr, either |=a a or |=a —i cr. H 

4. Given a sentence a and a positive integer n, we can effectively 
decide whether or not a has an n-element model. That is, the binary 
relation 


{(cr, n) | ct has a model of size n } 

is decidable. 

The key idea is that there are only finitely many structures to check, 
and we can do that. The sentence cr has a model of size n if and only 
if it has a model with universe {1,..., n } by observation 1. With the 
language restricted to the parameters that occur in cr, there are only 
finitely many such structures, and we can systematically generate all of 

them. (For example, if the only parameters are V and a 2-place predicate 

2 

symbol, then there are 2' r different structures.) Applying observation 3, 
we test to see if any of these are models of a. 

5. The spectrum of a sentence a is defined to be {n \cr has a model 
of size «}. See Exercise 16 in Section 2.2. It follows from observation 4 
that the spectrum of any sentence is a decidable set of positive integers. 

*Theorem 26D For a finite language, {cr | a has a finite model) is 
effectively enumerable. 

Proof. Here is a semidecision procedure: Given a , first check to 
see if it has a model of size 1, by using observation 4. If not, try 
size 2. Keep going. H 

*Corollary 26E Assume the language is finite, and let <J> be the set 
of sentences true in every finite structure. Then its complement, 
<t>, is effectively enumerable. 

Proof. For a sentence a, 

a e O •<==>■ (-1 cr) has a finite model. 

We can apply the above semidecision procedure to (—i cr). H 
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It follows (by Theorem 17F) that <t> is decidable if and only if it 
is effectively enumerable. But this does not happen. We state without 
proof the following: 

*Trakhtenbrot's Theorem (1950) The set of sentences 
<I> = {<r | a is true in every finite structure} 
is not in general decidable or effectively enumerable. 

Thus the analogue of the enumerability theorem for finite structures 
only is false. 

Size of Models 

In the proof of the completeness theorem in Section 2.5, we started with 
a consistent set F and formed a structure 21/ E in which it was satisfied. 
How large was that structure? We claim that if our initial language 
was countable, then |2l/£| is a countable set. Thus a consistent set of 
sentences in a countable language has a countable model. 

21 /E was constructed from a preliminary structure 21. The universe 
of 21 was the set of all terms in the language obtained by adding a 
countable set of new constant symbols. But the augmented language 
was still countable, so the set of all expressions (and hence the set of all 
terms) was countable. That is, |2t| was countable. 

The universe of 21 /E consisted of equivalence classes of members 
of 21, so it, too, is a countable set. (We can map 21/ E one-to-one into 
|2t| by assigning to each equivalence class some chosen member.) The 
conclusion is that 2t/£ is a countable structure, as claimed. 

Lowenheim-Skolem Theorem (1915) (a) Let T be a satisfiable set 

of formulas in a countable language. Then F is satisfiable in some 
countable structure. 

(b) Let L be a set of sentences in a countable language. If Y. 
has any model, then it has a countable model. 

Proof. First observe that I' is consistent, by the soundness theorem. 
Then by the completeness theorem (plus the foregoing remarks) 
it can be satisfied in a countable structure. H 

(There is another, more direct proof of this theorem that will be indi¬ 
cated in Section 4.2; see especially Exercise 1 there. That proof, which 
does not use a deductive calculus, begins with an arbitrary structure 21 
in which F can be satisfied, and by various manipulations extracts from 
it a suitable countable substructure in which F is still satisfied.) 

The Lowenheim-Skolem theorem was published by Leopold 
Lowenheim in 1915 for the case where T is a singleton; Thoralf Skolem 
in 1920 extended this to a possibly infinite F. The theorem marked a new 
phase in mathematical logic. Earlier work had been done in the direction 
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of formalizing mathematics by means of formal languages and deduc¬ 
tive calculi; this work was initiated largely by Gottlob Frege in 1879. 
For example, the Principia Mcithematica (1910-1913) of Whitehead 
and Russell carried out such a formalization in great detail. But the 
modern phase began when logicians stepped back and began to prove 
results about the formal systems they had been constructing. Other early 
work in this trend was done by David Hilbert, Emil Post, Kurt Godel 
(as mentioned before), Alfred Tarski, and others. 

For a sample application of the Lowenheim-Skolem theorem, let 
A s i be your favorite set of axioms for set theory. We certainly hope 
these axioms are consistent. And so they have some model. By the 
Lowenheim-Skolem theorem, the axioms have a countable model 6. 
Of course, 6 is also a model of all the sentences logically implied by 
A s i • One of these sentences asserts (when translated back into English 
according to the intended translation) that there are uncountably many 
sets. There is no contradiction here, but the situation is sufficiently 
puzzling to be called “Skolem’s paradox.” It is true that in the structure 
& there is no point that satisfies the formal definition of being a one- 
to-one map of the natural numbers onto the universe. But this in no 
way excludes the possibility of there being (outside 6) some genuine 
function providing such a one-to-one correspondence. 

Recall that the theory of 21, written Th 21, is the set of all sentences 
true in 21. We can apply the Lowenheim-Skolem theorem (with E = 
Th 21) to prove that for any structure 21 for a countable language, there 
is a countable elementarily equivalent structure 18. If 23 is a model of 
Th 21 then 21 = 23 because 

Nsi rr o G Th2l =A (=53 0 

and 


^21 cr 1=2 1 =M~-'CT)eTh2t=l> l=® ° => ^® <*■ 

For example, the real field (R; 0, 1, +, •) is an uncountable structure for 
a finite language. Therefore there must be some countable structure (also 
a field) satisfying exactly the same sentences. (In fact Tarski showed we 
can take the field of algebraic real numbers for this.) 

Example. Consider the structure 

21= (N;0, S, <,+,-)• 

We claim that there is a countable structure 9Jio, elementarily 
equivalent to 21 (so that 9J!o and 21 satisfy exactly the same sen¬ 
tences) but not isomorphic to 21. 

Proof. We will construct 2Jlo by using the compactness theorem. 
Expand the language by adding a new constant symbol c. Let 

E = {0 < c, SO < c, SSO < c,...}. 
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We claim that E U Th9l has a model. For consider a finite 
subset. That finite subset is true in 

91* = (N; 0, S, <,+, •, k) 

(where k — c mt ) for some large k. So by the compactness theorem 
E U Th 91 has a model. 

By the Lowenheim-Skolem theorem, E UTh 91 has a countable 
model 


971 = (|£DT|; 0 OT , S®\ < OT , + OT , •®\ c® 1 ). 

Let ilJio be the restriction of 9 71 to the original language: 

OT 0 = (|9R|;0 OT ,S TO ,< OT ,+® t ,. aR ). 

Since 9Jlo is a model of Th 91, we have 9Jlo = 91: 

t=«n & =>■ rr e Th91 =>■ l=on 0 & 

¥=<n a => -i ct e Th 91 =)> |=an 0 -la =>■ ¥=<m 0 a - 

We leave it to the reader to verify that 9J!o is not isomorphic to 
91. (|9Jlol contains the “infinite” number c OT .) H 

What about uncountable 1 languages? Assume that in the proof of 
the completeness theorem we started with a set T in a language of 
cardinality X. We claim that in this case, the structure 21 / E we made 
has cardinality <a. 

21 /E was constructed from that preliminary structure 21. The universe 
of 21 was the set of all terms in the language obtained by adding X new 
constant symbols. So the augmented language still had cardinality X. 
So (by Theorem 0D) the set of all expressions (and hence the set of all 
terms) had cardinality <X. (In fact, because we had at least the X new 
constant symbols, the set of terms had cardinality exactly A.) 

The universe of 2l/£ consisted of equivalence classes of members of 
21, so card |2l/£| < card |2l|. (We can map |2l/£| one-to-one into |2t| 
by assigning to each equivalence class some chosen member, but now 
we may need the axiom of choice.) Thus, when the smoke had cleared, 
T was satisfied in a structure 21/£ of cardinality <X. 

Lowenheim-Skolem Theorem (a) Let T be a satisfiable set of for¬ 
mulas in a language of cardinality X. Then T is satisfiable in some 
structure of size <X. 

(b) Let E be a set of sentences in a language of cardinality X. 
If E has any model, then it has a model of cardinality <X. 


1 The reader who wishes to avoid uncountable cardinals is advised to skip to the subsection 
Theories. 
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The earlier version of the Lowenheim-Skolem theorem is a special 
case of this version, wherein X = No- 

Suppose that we have an uncountable structure 21 for a countable 
language. By the Lowenheim-Skolem theorem (applied to Th 21) there 
is a countable 23 that is a model of Th2l, and hence 21 = 03, as noted 
previously. 

Conversely, suppose that we start with a countable structure 18. Is 
there an uncountable 21 such that 21 = *8? If 03 is finite (and the language 
includes equality), then this is impossible. But if *8 is infinite, then there 
will be such an 21, by the following “upward and downward Lowenheim- 
Skolem theorem.” The upward part is due to Tarski, whence the “T” of 
“LST.” 


LST T heorem Let F be a set of formulas in a language of cardinality 
X, and assume that T is satisfiable in some infinite structure. Then 
for every cardinal k > X, there is a structure of cardinality k in 
which T is satisfiable. 

Proof. Let 21 be the infinite structure in which T is satisfiable. 
Expand the language by adding a set C of k new constant symbols. 
Let 


E = {ci ^ C 2 | ci, Co distinct members of C}. 

Then every finite subset of E U T is satisfiable in the structure 
21, expanded to assign distinct objects to the finitely many new 
constant symbols in the subset. (Since 21 is infinite, there is room 
to accommodate any finite number of these.) So by compactness 
E U T is satisfiable, and by the Lowenheim-Skolem theorem it 
is satisfiable in a structure 03 of cardinality <k. (The expanded 
language has cardinality X + k = k.) But any model of E clearly 
has cardinality >k. So 03 has cardinality k: restrict 03 to the 
original language, H 

Corollary 26F (a) Let E be a set of sentences in a countable lan¬ 

guage. If E has some infinite model, then E has models of every 
infinite cardinality. 

(b) Let 21 be an infinite structure for a countable language. Then 
for any infinite cardinal X, there is a structure *8 of cardinality X 
such that *8 = 21. 

Proof, (a) Take T = E, X = K 0 in the theorem. 

(b) Take E = Th 21 in part (a). H 

Consider a set E of sentences, to be thought of as nonlogical ax¬ 
ioms. (For example, E might be a set of axioms for set theory or a set 
of axioms for number theory.) Call E categorical iff any two models 
of E are isomorphic. The above corollary implies that if E has any 
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infinite models, then £ is not categorical. There is, for example, no 
set of sentences whose models are exactly the structures isomorphic 
to (N; 0, S, +, •)■ This is indicative of a limitation in the expressive¬ 
ness of first-order languages. (As will be seen in Section 4.1, there are 
categorical second-order sentences. But second-order sentences are pe¬ 
culiar objects, obtained at the cost of holding the notion of subset fixed, 
immune from interpretation by structures.) 

Theories 

We define a theory to be a set of sentences closed under logical impli¬ 
cation. That is, T is a theory iff T is a set of sentences such that for any 
sentence cr of the language, 

T |= a =>■ a e T. 

(Note that we admit only sentences, not formulas, with free variables.) 

For example, there is always a smallest theory, consisting of the 
valid sentences of the language. At the other extreme there is the theory 
consisting of all the sentences of the language; it is the only unsatisfiable 
theory. 

For a class AC of structures (for the language), define the theory of 1C 
( written Th 1C) by the equation 

Th/C = {cr | a is true in every member of JC}. 

(This concept arose previously in the special case 1C — {21}.) 

Theorem 26G Th/C is indeed a theory. 

Proof. Any member of K, is a model of Th/C. Thus if o is true 
in every model of Th/C, then it is true in every member of 1C. 
Whence it belongs to Th /C. H 

For example, if the parameters of the language are V, 0,1, +, and •, 
and T is the class of all fields, then Th T. the theory of fields, is simply 
the set of all sentences of the language which are true in all fields. If Tq 
is the class of fields of characteristic 0, then Th Tq is the theory of fields 
of characteristic 0. 

Recall that for a set £ of sentences, we defined Mod £ to be the class 
of all models of £. Th Mod £ is then the set of all sentences which are 
true in all models of £. But this is just the set of all sentences logically 
implied by £. Call this set the set of consequences of £, Cn £. Thus 

Cn£ = {or | £ 1= cr} 

= ThMod £. 

For example, set theory is the set of consequences of a certain set of 
sentences, known, unsurprisingly, as axioms for set theory. A set T of 
sentences is a theory iff T — Cn 7'. 
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A theory T is said to be complete iff for every sentence a, either 
a e T or (—i cr) e T. For example, for any one structure 21, Th{2l} 
(written, as before, “Th 21”) is always a complete theory. In fact, it is 
clear upon reflection that Th /C is a complete theory iff any two members 
of K, are elementarily equivalent. And a theory T is complete iff any 
two models of T are elementarily equivalent. 

For example, the theory of fields is not complete, since the sentences 

1 + 1 = 0 , 

3 x x • x — 1 + 1 

are true in some fields but false in others. The theory of algebraically 
closed fields of characteristic 0 is complete, but this is by no means 
obvious. (See Theorem 26J.) 

*Definition. A theory T is axiomatizable iff there is a decidable set 
E of sentences such that T — Cn E. 

Definition. A theory T is finitely axiomatizable iff T — Cn E for 
some finite set E of sentences. 

In the latter case we have T — Cnfer} (written “T — Cner”), where 
a is the conjunction of the finitely many members of E. For example, 
the theory of fields is finitely axiomatizable. For the class T of fields 
is Mod «T>, where <t> is the finite set of field axioms. And the theory of 
fields is Th Mod <I> = Cn <t>. 

The theory of fields of characteristic 0 is axiomatizable, being Cn ct> 0 , 
where «T>o consists of the (finitely many) field axioms together with the 
infinitely many sentences: 

1 + 1 / 0 , 

1 + 1 + 1 / 0 , 


This theory is not finitely axiomatizable. To prove this, first note that no 
finite subset of <E>o has the entire theory as its set of consequences. (For 
that finite subset would be true in some field of very large characteristic.) 
Then apply the following: 

Theorem 26H If Cn E is finitely axiomatizable, then there is a finite 
Eo c E such that Cn Eo = Cn E. 

Proof. Say that Cn E is finitely axiomatizable; then Cn E = Cn r 
for some one sentence r. In general r ^ E, but at least E(= r. 
(r e Cn r = Cn E.) By the compactness theorem there is a finite 
Eo c E such that Eo / r. Then 

Cn r C Cn Eo C Cn E, 


whence equality holds. 
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We can now restate Corollaries 25F and 25G in the present termi¬ 
nology: 

*Corollary 261 (a) An axiomatizable theory (in a reasonable lan¬ 

guage) is effectively enumerable. 

(b) A complete axiomatizable theory (in a reasonable lan¬ 
guage) is decidable. 

We can represent the relationships among these concepts by means 
of a diagram (in which we have included the results of Exercise 6): 

Decidable ;e Finitely axiomatizable 

Effectively enumerable < > Axiomatizable 

For example, a theory that is given in axiomatic form (such as 
Zermelo-Fraenkel set theory, which is Cn A zf for a certain set Azf) 
is effectively enumerable. We will argue in Section 3.7 that set the¬ 
ory (if consistent) is not decidable and not complete. Number theory, 
the theory of the structure (N; 0, S, E), is complete but is not 

effectively enumerable and hence not axiomatizable (Section 3.5). 

We can use part (b) of the preceding corollary to establish the de¬ 
cidability of an axiomatizable theory, provided we can show that the 
theory in question is complete. This can sometimes be done by means 
of the Los-Vaught test for completeness. 

Say that a theory T is Ko -categorical iff all the infinite countable 
models of T are isomorphic. More generally, for a cardinal k, say that T 
is k- categorical iff all models of T having cardinality k are isomorphic. 

Eos- Vaught Test (1954) Let T be a theory in a countable lan¬ 
guage. Assume that T has no finite models. 

(a) If T is Ko-categorical, then T is complete. 

(b) If T is k -categorical for some infinite cardinal k, then T is 
complete. 

Proof. It suffices to show for any two models 21 and 23 of T that 
21 = 23. Since 21 and 23 are infinite, there exist (by the LST 
theorem) structures 21' = 21 and 23' = 23 having cardinality k. 21' 
is isomorphic to 23', so we have 

21 = 21' = 23' = 23. 


Thus 21 = 23. H 

(If T is a theory in a language of cardinality X, then we must demand 
that X < k.) 
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The converse to the Los-Vaught test is false. That is, there are com¬ 
plete theories which are not k -categorical for any k. 

In Section 3.1 we will apply the Los-Vaught test to prove the decid¬ 
ability of the theory of the natural numbers with zero and successor. It 
can also be used to prove the decidability of the theory of the complex 
field. But this proof will utilize an excursion into algebra. 

Theorem 26J (a) The theory of algebraically closed fields of char¬ 

acteristic 0 is complete. 

*(b) The theory of the complex field 

e= (C; 0, !,+,-) 


is decidable. 

Proof. Let A be the class of algebraically closed fields of charac¬ 
teristic 0. Then A — Mod(<J>o U T), where <t>o consists as before 
of the axioms for fields of characteristic 0, and T consists of the 
sentences 

V a V b V c(a ^ 0 —► 3 x a • x • x + b • x + c — 0), 

V flV £> V c V d(a ^ 0 —* 3 x a • x • x • x b • x • x + c • x + d = 0 ), 

The set <t>o U T is decidable and Th A — Cn(<t>o U T), so this 
theory is axiomatizable. Part (a) of the theorem asserts that the 
theory is also complete, whence it is decidable. 

Part (b) follows from part (a). For we have £ e A, whence 
Th A c Th €. The completeness of Th A implies that equality 
holds; see Exercise 2. 

To prove part (a), we apply the Los-Vaught test. The models 
of Th A are exactly the members of A. These are all infinite. We 
further claim that Th A is categorical in any uncountable cardinal¬ 
ity. This is equivalent to saying that any two algebraically closed 
fields of characteristic 0 having the same uncountable cardinality 
are isomorphic. 

This last assertion is a known result of algebra. We will sketch 
the proof, for the interest of those readers familiar with this topic. 
Any field 5 is obtainable in the following way: (1) One begins 
with the prime subfield, which is determined to within isomor¬ 
phism by the characteristic of (2) One takes a transcendental 
extension, determined within isomorphism by the cardinality of 
the transcendence basis, i.e., by the transcendence degree of ft 
(over its prime subfield). (3) One finally takes some algebraic ex¬ 
tension. We thus have a theorem of Steinitz: Two algebraically 
closed fields are isomorphic iff they have the same characteristic 
and the same transcendence degree. 

If the transcendence degree of an infinite field 5 is k, then the 
cardinality of $ is the larger of k and K (l . Hence for an uncountable 



Chapter 2: First-Order Logic 


159 


field, the cardinality equals the transcendence degree. So we con¬ 
clude from Steinitz’s theorem that two algebraically closed fields 
having the same characteristic and the same uncountable cardi¬ 
nality are isomorphic. H 

The theory of the real field 

(R; 0,1,+,-) 

is also decidable. But this result (which is due to Tarski) is much deeper 
than the above theorem. The theory of the real field is not categorical in 
any infinite cardinality, so the Los-Vaught test cannot be applied. 

As a final application, we can show that the ordering of the rationals 
is elementarily equivalent to the ordering of the reals, 


( Q ; <q) = <«)> 


where Q and R are the rationals and reals, respectively, and <q and < r 
are the corresponding orderings. To show elementary equivalence, we 
show that both are models of some complete theory (which then must 
coincide with the theory of each structure). The key fact is provided by 
a theorem of Cantor: Any two countable dense linear orderings without 
endpoints are isomorphic. 

To give the details, we must back up a little. The language here has 
equality and the parameters V and <. Let <5 be the conjunction of the 
following sentences: 

1. Ordering axioms (trichotomy and transitivity): 

V.r V y(x <yVx = yVy<x), 

V x V y(x < y —► y yC x), 

V.r Vy Vz(x < y —* y < z —► x < z). 

2. Density: 

V x V y(x < y —► 3 z(x <z<y )). 

3. No endpoints: 


V.r 3 y 3 z(y < x < z). 

The dense linear orderings without endpoints are, by definition, the 
structures for this language that are models of S. It is clear that they are 
all infinite. Furthermore, we claim that the theory of these orderings, 
CnS, is Ko-categorical. This is provided by the following fact. 

Theorem 26K (Cantor) Any countable model of S is isomorphic to 

(Q,<e). 


We leave the proof to Exercise 4. 
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We can now apply the Los-Vaught test to conclude that Cn<5 is 
complete. Hence any two models of <5 are elementarily equivalent; in 
particular, 

( Q ; <q) = ( R ; <«). 

We can also conclude that these structures have decidable theories. 

Prenex Normal Form 


It will at times be convenient to move all the quantifier symbols to the 
left of other symbols. For example, 

V.r (Ax —* Vy Bxy) 

is equivalent to 

Vi Vy(Ax —* Bxy). 

And 

V.r (Ax —»• 3 y Bxy) 

is equivalent to 

Vi 3 y(Ax —» Bxy). 

Define a prenex formula to be one of the form (for some n > 0) 

Q\x\ ■ ■ ■ Q n x n a, 

where Q , is V or 3 and a is quantifier-free. 

Prenex Normal Form Theorem For any formula, we can find a log¬ 
ically equivalent prenex formula. 

Proof. We will make use of the following quantifier manipulation 
rules. 


Qla. -iViffl |==] 3j-ia. 

Qlb. -i 3 x a |==] Vx -i a. 

Q2a. (a — ► Vx/1) (==| V.i(a — > fi) for.i not free in a. 
Q2b. (a — >-3 x P) |==] 3 x(a —> ft) for x not free in a. 
Q3a. (V.i a — > ft) |==| 3 x(a — * ft) for x not free in ft. 
Q3b. (3ia —y P) |==] V.r(oi —* P) for x not free in p. 


Q1 is clear; for the others see the examples in Section 2.4 and 
Exercise 8 there. 

We now show by induction that every formula has an equivalent 
prenex formula. 


1. For atomic formulas this is vacuous, as any quantifier-free 
formula is trivially a prenex formula. 

2. If a is equivalent to the prenex a', then V x a is equivalent 
to the prenex Via'. 
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3. If a is equivalent to the prenex a ', then —\ a is equivalent to 
—i a'. Apply Q1 to -i a' to obtain a prenex formula; for example, 

-iVxByBzjS |==] 3 x V y Vz-i p. 

4. Finally we come to the case of a — y p. By inductive hy¬ 
pothesis we have prenex formulas a' and f>' equivalent to a and 
P, respectively. By our theorems on alphabetic variants, we may 
further assume that any variable that occurs quantified in one of 
the formulas a' and p' does not occur at all in the other. We 
then use Q2 and Q3 to obtain a prenex formula equivalent to 
a' — p' (and hence to a — y P). Observe that there is some lati¬ 
tude in the order in which the rules Q2 and Q3 are applied. For 
example, 

V x 3 y q> — y3u ip 

(where x and y do not occur in ip, u does not occur in (p) is 
equivalent to any of the following: 

3x V y 3 u(ip —y i p), 

3 x 3 mV y(ip — y ip), 

3u3xVy((p—y ip). 


Retrospectus 

At the beginning of this book it was stated that symbolic logic is a 
mathematical model of deductive thought. This is as good a time as 
any to reflect on that statement, in the light of the material treated 
thus far. 

As a first example, consider a mathematician working in set theory. 
He uses a language with an equality symbol, a symbol e for member¬ 
ship, and numerous defined symbols (0, U, and others). In principle the 
defined symbols could be eliminated and any sentence replaced by an 
equivalent sentence in which the defined symbols did not appear. (In 
this connection see Section 2.7, where this topic is treated systemati¬ 
cally.) He takes as primitive (or undefined) notions the concepts of set 
and membership. He adopts some set A$t of axioms involving these 
concepts. He asserts that for certain sentences (his theorems), these 
sentences are true provided the axioms are true, regardless of what the 
undefined notions of set and membership actually mean. In support 
of these assertions he offers proofs, which are finitely long arguments 
intended to convince his colleagues of the correctness of the assertions. 

In terms of first-order logic we can describe all this as follows: The 
language here is a first-order language with equality and a two-place 
predicate symbol e. Thus V and e are the only parameters open to 
interpretation. There is a certain set Ast of sentences in this language 
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singled out as being the set of (nonlogical) axioms. Then certain other 
sentences are logical consequences of Ast. i.e., are true in any model of 
Ast- If r is a consequence of Ast (and only then), there is a deduction 
of r from Ast- 

Next consider a more typical case of the hypothetical working math¬ 
ematician, that of the algebraist or analyst. The algebraist uses axioms 
for (say) group theory, but he also employs some amount of set theory. 
Similarly, the analyst deals with sentences that involve both numbers 
and sets of numbers. In both cases it is generally recognized that one 
could, in principle, convert the assertions of algebra and analysis to as¬ 
sertions of set theory. And then the remarks of the preceding paragraph 
again apply. 

The interest that symbolic logic holds for the mathematician is largely 
due to the accuracy with which it mirrors mathematical deductions. In 
the long run, it will surely be useful to understand the fundamental 
processes of doing mathematics. 

There remains the question of the accuracy with which first-order 
logic mirrors nonmathematical deductive thought. Logic, symbolic and 
nonsymbolic, has always formed a traditional part of the philosophical 
study of the process by which people come to hold certain ideas. Non¬ 
mathematical examples to which first-order logic applies are provided 
by a vast array of frivolous situations. Lewis Carroll gave such exam¬ 
ples, one of which inferred that babies cannot manage crocodiles from 
the three hypotheses: (1) Babies are illogical. (2) Nobody is despised 
who can manage a crocodile. (3) Illogical persons are despised. 

But what of nonfrivolous situations? Here the applicability is ob¬ 
scured by the fact that we usually do not make explicit the assump¬ 
tions we use in drawing conclusions. There are specific areas (in di¬ 
verse fields such as physics, medicine, and law) where assumptions 
not only can be made explicit but are being made explicit. In some 
cases it appears that less than the full versatility of first-order logic is 
required to formalize the real-life deductions. Possibly in other cases — 
ranging from everyday life to quantum mechanics — more features may 
be necessary. 


Exercises 

1. Show that the following sentences are finitely valid (i.e., that they 
are true in every finite structure): 

(a) 3 x 3 v 3 z\(Pxfx —* Pxx) V (Pxy A Pyz A -i Pxz)] 

(b) 3.vVy 3z[(Qzx —* Qzy) —► (Qxy —► Qxx)] 

Suggestion: Show that any model of the negation must be infinite. 

2. Let T\ and 73 be theories (in the same language) such that (i) 7) C 
72, (ii) T\ is complete, and (iii) 73 is satisfiable. Show that 7) = 73. 
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3. Establish the following facts: 

(a) Ei C E 2 Mod Eh L= Mod Ei. 

£i c K 2 =F Th K .2 c Th/Ci. 

(b) E C ThMod E and /C c Mod Th/C. 

(c) Mod E = Mod Th Mod E and Th /C = Th Mod Th/C. (Part 
(c) follows from (a) and (b).) 

4. Prove that any two countable dense linear orderings without end¬ 
points are isomorphic (Theorem 26K). Suggestions: Let 21 and 23 
be such structures with |2l| = {ao, «t, ■ ■ •} and 1*81 = { b o, b\, .. .}. 
Construct an isomorphism in stages; at stage 2 n be sure a n is paired 
with some suitable bj, and at stage 2 n + 1 be sure b„ is paired with 
some suitable a,. 

5. Find prenex formulas equivalent to the following. 

(a) (3 x Ax A 3 x Bx) —*■ Cx. 

(b) V x Ax <->3r Bx. 

*6. Prove the converse to part (a) of Corollary 261: An effectively enu¬ 
merable theory (in a reasonable language) is axiomatizable. Sug- 
gestion: The set {<7o, <8, <72, • ■.} is equivalent (in the sense of having 
the same models) to the set {cro, <to A < t \, cto A cq A <72, ...}. 

7. Consider a language with a two-place predicate symbol <, and 
let 21 = (N; <) be the structure consisting of the natural numbers 
with their usual ordering. Show that there is some 21 elementarily 
equivalent to SB such that < a has a descending chain. (That is, 
there must be ao, a\, ... in |2t| such that (a,-+i, a,) e < a for all i.) 
Suggestion: Apply the compactness theorem. 

Remark: The point of this exercise is to demonstrate that in 
this language, one can not express, "There is no descending chain.” 

8. Assume that a is true in all infinite models of a theory T. Show 
that there is a finite number k such that o is true in all models 21 of 
T for which |2l| has k or more elements. 

9. Say that a set E of sentences has the finite model property iff each 
member <7 of E, if it has any models at all, has a finite model. 
Assume that E is a set of sentences in a finite language (i.e., a 
language with finitely many parameters) and that E has the finite 
model property. Give an effective procedure that, given any member 
<7 of E, will decide whether or not a has any models. Suggestion: Is 
the set of such sentences effectively enumerable? Is its complement 
effectively enumerable? 

10. Assume we have a finite language without function symbols. 

(a) Show that the set of satishable 3i sentences is decidable. (See 
Exercise 19 in Section 2.2 for terminology and background.) 
Suggestion: Apply the preceding exercise. 
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(b) Show that the set of valid V 2 sentences is decidable. (A V 2 
formula is one of the form Vxi • • • Vx„, 3 y\ ■ ■ ■ 3 y n 9 where 
6 is quantifier-free.) 

Remarks: In first-order logic, the “decision problem” (Entschei- 
dungsproblem) is the problem of deciding, given a formula, whether 
or not it is valid. By Church’s theorem (Section 3.5), this problem 
in general is unsolvable. This exercise gives a solvable subcase of 
the decision problem. 


SECTION 2.7 _ 

Interpretations Between Theories 1 

In some cases a theory T\ can be shown to be every bit as powerful as 
another theory Tq . This is certainly the case if the theories are in the 
same language and Tq C 7). But even if the theories are in different 
languages, there may exist a way of translating from one language to 
the other in such a way that members of 7o are translated as members 
of T\. This sort of situation will be examined in this section. 

We will begin by discussing the topic of defined symbols. This topic, 
as well as having significant interest of its own, will serve as an example 
for the situation of the preceding paragraph, wherein Tq is constructed 
from 7j by adding a new defined symbol. If the definition is done prop¬ 
erly, the original theory 7) should in principle be just as strong as the 
new Tq. We will consider only the case of defined function symbols, 
since the case of defined predicate symbols presents, in comparison, no 
real difficulties. 

Defining Functions 

Frequently in mathematics it is useful to introduce definitions of new 
functions. For example, in set theory one defines the power-set operation 
V by a sentence like, “Fet Vx be the set whose members are the subsets 
of x.” Or by a sentence in a formal language (here containing e, C, 
and V), 


V vi V V 2 [Vv\ — V 2 ■*—► V u{u e V 2 •*-» u c ui)]. 

Now definitions are unlike theorems and unlike axioms. Unlike the¬ 
orems, definitions are not things we prove. We just declare them by 
fiat. But unlike axioms, we do not expect definitions to add substantive 
information. A definition is expected to add to our convenience, not to 
our knowledge. 


1 The results of this section will be used only in the latter part of Section 3.7. 
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If this expectation is to be realized, the definition must be made in 
a reasonable way. As an example of a most unreasonable definition in 
number theory, suppose that we introduce a new function symbol by the 
“definition” 


f(x) = y iff x < y. 

(Or by the sentence in a formal language: V V\ V v 2 ifv\ — v 2 <—* fi < 
if?)-) Since we know that 1 < 2, we see that /(1) = 2. But also 1 < 3, 
so we obtain /(1) = 3. And so we come to the conclusion (which does 
not itself involve /) that 2 = 3. 

Obviously this definition of / was in some way very bad. It did not 
just make matters convenient for us; it enabled us to conclude that 2 = 3, 
which we could not have done without the definition. The trouble was 
that the definition bestowed the name “/(1)” ambiguously upon many 
things (2 and 3 among them). Thus /(1) was not “well defined.” Names 
ought to designate unique objects. 

In this subsection we want to consider conditions under which we can 
be assured that a definition will be satisfactory. To simplify the notation, 
we will consider only the definition of a one-place function symbol /, 
but the remarks will apply to n -place function symbols as well. 

Consider then a theory T in a language not yet containing the one- 
place function symbol /. (For example, T might be the set of conse¬ 
quences of your favorite axioms for set theory.) We want to add / to 
the language, introducing it by the definition 

Vvi Vv 2 [fv\ = V 2 •*->■ <p], (S) 

where ip is a formula in the original language (i.e., a formula not con¬ 
taining /) in which only V\ and v 2 may occur free. 

Theorem 27A In the above situation, the following are equivalent: 

(a) (The definition is noncreative.) For any sentence a in the 
smaller language, if 

T;8\=a 

(in the augmented language), then already T |= a. 

(b) i f is well defined.) The sentence 

\/v\3\v 2 (p is) 

is in the theory T. (Here “3 !i>2 (fi” is an abbreviation for a longer 
formula; see Exercise 21 of Section 2.2.) 

Proof. To obtain (a) =>■ (b), simply note that 8 |= s. So by taking 
a = s in part (a), we obtain T |= s. 

Conversely, assume that T |= e. Let 21 be a model of T. 
(21 is a structure for the original language.) For d e |2l|, let Fid) 
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be the unique e e |2l| such that |=<j( (p^cl, ej. (There is a unique 
such e because |=a e.) Let (51, F) be the structure for the aug¬ 
mented language that agrees with 21 on the original parameters 
and that assigns F to the symbol /. Then it is easy to see that 
(21, F) is a model of <5. Furthermore, 21 and (21, F) satisfy the 
same sentences of the original language. In particular (21, F ) is a 
model of T. Hence 

T\ S \= 0 =>• |=(21,F) 0 

=> 1=21 er. _| 


(This argument can be stated more briefly by using second-order 
logic, s is logically equivalent to the sentence 3 f S.) 

Interpretations 

The basic idea is that it is possible for one theory to be just as strong (in 
a sense to be made precise) as another theory in another language. In 
considering two languages simultaneously, there is no point in allowing 
them to conflict; e.g., the negation symbol of one language should not 
be a predicate symbol in the other. We can eliminate such conflicts by 
assuming that each of the languages is obtained from a third parent 
language by deleting some parameters (and perhaps equality). 

For example, axiomatic set theory is at least as strong as the theory of 
the natural numbers with zero and successor, i.e., the theory of (N; 0, .S'). 
Any sentence in the language of (N; 0, S ) can be translated in a natural 
way into a sentence of set theory. (This translation is sketched briefly 
in Section 3.7.) If the original sentence was true in (N; 0, S), then the 
translation will be a consequence of the axioms of set theory. (This is 
not obvious; the proof uses facts to be developed in Section 3.1.) 

Let us look more carefully at a second example. Consider on the one 
hand the theory of 


(N; 0, S ) 

in its language, and on the other hand the theory of 


(Z; +, •) 


in its language. (Here Z is the set of all integers, positive, negative, and 
zero.) We will shortly be in a position to claim that the second theory is 
as strong as the first. How might a sentence about the natural numbers 
N with 0 and S be translated into a sentence about the integers Z with 
addition and multiplication? 

The first clue is provided by Lagrange’s theorem from number the¬ 
ory: an integer is nonnegative iff it is the sum of four squares. Thus a 
quantifier V x in the first language (where x is intended to range over N) 



Chapter 2: First-Order Logic 


167 


can be replaced by 

V x( 3 yi 3 y 2 3 y 3 3y 4 x = yi • yi + y 2 • y 2 + 73 • 73 + y 4 • 74—» 

in the second language. 

The second clue is that {0} and the successor function (viewed as a 
relation) are definable in (Z; +, •)• The set {0} is defined by 

Vi + Vi = V\. 

The successor relation (extended to Z) is defined by 

Vz(z • z = z A z + : ^ + : = v 2 ). 

Thus the sentence about (N; 0, S) 

VxSx / <) 

can be translated into 

V x[3 y\ 3 y 2 3 y 3 3 y 4 x — y i • yi + y 2 • 72 + 73 • 73 + 74 • 74^ 
-iV«(« + h = m-»Vii(Vz(z'Z = zAz-Fz^z->.v + z 
= V) —>v = u))]. 

So much for examples. For our general discussion it will be helpful 
to introduce the notation 

(Pit) = <P?\ 

<Pih,t 2 )= {<Ph)2’ 

and so forth. Thus tp = tp(V]) — q)(v 1 , v 2 ). If we use “<p(x)” we will 
not worry too much about whether or not x is substitutable for V\ in tp. 
If it is not, then we actually want <p(x) to be 1 /r^ 1 2 , where i/r is a suitable 
alphabetic variant of ip. 

Assume now that we have the following general situation: 

Lq is a language. (A language can for all practical purposes be a set 
of parameters, possibly augmented by the equality symbol.) 

T\ is a theory in a (possibly different) language L\, which includes 
equality. 

Definition. An interpretation n of Lq into 7) is a function on the 
set of parameters of Lq such that 

1. 7 x assigns to V a formula zrv of L \ in which at most V\ occurs 
free, such that 

(i) Ti \= 3 viity. 

(The idea is that in any model of 7), the formula TTy should define 
a nonempty set to be used as the universe of an Lo-structurc.) 

2. 7 x assigns to each n-place predicate parameter P a formula 
Up of L\ in which at most the variables V\,... v n occur free. 
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3. tc assigns to each n -place function symbol / a formula jXf 
of L i, in which at most v\,... ,v n , v n+ \ occur free, such that 

T\ |= Vui • ■•V^ 1 (lv(»l) —►-* TX\j(v n )-+ 

' 11 3x(jxy(x) a V v n+ i(7Tf(v u ..., v n+ i) <-y v n+ i = x))). 

(In English, this formula becomes, “For all v in the set defined by 
7Ty, there is a unique x such that nf(v, x), and furthermore x is 
in the set defined by 7Tv-” The idea is to ensure that in any model 
of T\, 7Tf defines a function on the universe defined by jty. In the 
case of a constant symbol c, we have n = 0 and (ii) becomes 

T\ |= 3 x(ttv(x) A V V\(tt c (vi) Vx = x)). 

In other words, n c defines a singleton whose one member is also 
in the set defined by Tty.) 

For example, if Lq is the language of (N; 0, S ) and T\ is the theory 
of (Z; +, •), then we have 

ny(x) — 3 yi 3 y 2 3 y 3 3 y 4 x — y 3 • y i + y 2 • y 2 + y 3 • J 3 + y 4 • y 4 , 

7To(x) —X + X = X, 

xs(x, y) = V z (z • z = z A z + z # z —* x + z — y)- 

(We are here exploiting the fact that in (Z; +, •) we can, in effect, define 
the structure (N; 0, S).) 

If Lq coincides with L \ , there is trivially the identity interpretation 
n , for which 

7ty = v\ = v \, 

Tip = Pv 1 • • • V n , 

IXf — f V] ' ' ' V n — V n+ 1. 

The conditions (i) and (ii) are then met no matter what T\ is. 

Now assume that tc is an interpretation and let 03 be a model of T\. 
There is a natural way to extract from 03 a structure 71 03 for Lo- Namely, 
let 

| ,r 03| = the set defined in 03 by Tty, 

P n * — the relation defined in 18 by 7 Tp, restricted to 1^031, 
/ ®(«i,..., a„) — the unique b such that tx f\a 3 ,..., a n , £>]], 

where a\, ,.., a„ are in 1*031. 

By condition (i) in the definition of interpretations, |*03| ^ 0. And, by 
condition (ii). the definition of /** makes sense; i.e., there is a unique 
b meeting the above condition. Hence *18 is indeed a structure for the 
language Lq. 

Define the set rx 1 [7) ] of /lo-sentences by the equation 

7r _1 [T 1 ] = Th{*18 | 03 e Mod Ti} 

= {a | cr is an Lo-sentence true in every structure *03 
obtainable from a model 03 of 7) |. 
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This is a theory, as is Th/C for any class AC. It is a satisfiable theory iff 
7j is satisfiable. 

Example. Earlier in this section we had a theory T containing the 
sentence 


Vw]3!t>2 <p. (e) 

We augmented the language to a larger language L + that contained 
a function symbol /. The “definition” of / was provided by the 
L + -sentence 


V V\ V V 2 (fvi — Vo <->■ <p). (S) 

We showed that for a sentence a in the original language of T , if 
T; 8 \= cr, then T \= cr. 

We have an interpretation jt from L + into 7 ’. n is the identity 
interpretation on all parameters except /. The formula rtf is (p. The 
fact that T |= s is just what we need to verify that n is indeed an 
interpretation. For any model 21 of T, 71 21 is a structure previously 
called ( 21 , F); it is a model of T; S. 

We claim that 

7r _1 [r] = Cn(T ; A). 

First observe that any model *B of T\ S equals 71 03, where 21 is the 
restriction of 03 to the language of T . Hence for an L + -sentence a, 

a e Jt~ l [T] <£> |=* 2 i a for every model 21 of T 

O |=!g a for every model 03 of T; A 

<*• T\ S |= a. 

Syntactical Translation 

In the preceding subsection on interpretations we talked about arbitrary 
models and such. But the reader may already have noticed that there is 
a much more down-to-earth thing to be said about an interpretation it 
of Lq into 7j. Briefly: We can, given a formula (p of Lq, find a formula 
( p 71 in L\ which in some sense corresponds exactly to <p. We define (p 71 
by recursion on <p. 

First, consider an atomic formula a of Lq. For example, if a is 

Pfgx, 

then a is logically equivalent to 

V y(gx — y —► V z(fy — z^ Pz )). 

And we can take for a n the L \ -formula 


\/y(jt g {x,y) —> V z(n f (y, z) — >7Vp(z))). 
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In general, scan an atomic formula a from right to left. The rightmost 
place at which a function symbol occurs will initiate a segment of the 
form gx i • • • x„, for some n -place g. (In the example n = 1.) Replace this 
by some new variable y, and prefix V y{rc g (x \, ..., x n , >’))—. Continue 
to the next place at which a function symbol occurs. Finally, replace the 
predicate symbol P (if a parameter) by jt P (with the correct variables). 

The definition of a 77 can be stated more carefully by using recursion 
on the number of places at which function symbols occur in a. If that 
number is zero, then a is Px\ ■ ■ ■ x n and a 77 is 7tp(xi,..., x„). Other¬ 
wise, take the rightmost place at which a function symbol g occurs. If 
g is an n-place symbol, then that place initiates a segment gx\ ■ ■ ■ x n . 
Replace this segment by some new variable y, obtaining a formula we 
can call a^ Xv " Xn . Then a 77 is 

Vy(jr s (xi,..., x n , y) —> (a** 1 •"*■)*). 

For example, 

(PfgxY = Vy(iz g {x, y) — ► ( Pfyf ) 

= Vy(n g (x,y) z(n f (y,z) —> (. PzY )) 

= Vy(jr g (x, y ) -^Vz(Jt f (y, z) 7tp(z))). 

The interpretation of a nonatomic formula is defined in the obvi¬ 
ous way. (-i (p) n is (—i cp n ), (cp —» \jfY is ((fi 71 —■► ir 71 ), and (V.r ip) n is 
Vx(j rv(x) —> < p 7I ). (Thus the quantifiers are “relativized” to 7T\/.) 

The sense in which (p 77 “says the same thing” as <p is made precise in 
the following basic lemma. 

Lemma 27B Let n be an interpretation of Lq into 7), and let *B be 
a model of T\. For any formula <p of Lq and any map v of the 
variables into \ n< B\, 

l=-®<o[«] iff 

This is not a deep fact. It just says that cp 71 was defined correctly. 

Proof. We use induction on <p, but only the case of an atomic 
formula a is nontrivial. For a , we use induction on the number of 
places at which function symbols occur. It is easy if that number 
is zero. Otherwise, 

a 77 =Vy(n g (x,y) —> fi 77 ), 

where fi y gx — a. (We have quietly assumed that g is a one-place 
symbol; the notation is bad enough already.) Let 

b = the unique b such that f=<g 7r g [[.s(.r), 

= /*($(*)). 



Chapter 2: First-Order Logic 


171 


Then 

Nb a"[s]4H=<8 r[s(y\b)] 

fi[s(y | b )] by the inductive hypothesis 
N"8 f5 y gx [s] by the substitution lemma 
44 |=*23 «[■*]• 


The following corollary justifies our choice of notation for tt 1 [7j ]. 

Corollary 27C For a sentence er of Lq, 

a e Tr _1 [7’ 1 ] iff er 71 e 7j . 

Proof. Recall that by definition 

a e 7T _1 [7j] 44 for every model 03 of 7j, ^rsg a 

44 for every model 03 of 7j, |=<b a 71 by Lemma 27B 

Definition. An interpretation n of a theory To into a theory 7\ is 
an interpretation n of the language of Tq into 7j such that 

roc^m]. 

In other words, it is necessary that for an Lo-sentence er, 
er € 7o =S e 7j. 

7T —1 [7i] is the largest theory that tt interprets into 7j. If Tq — 7T —1 [7"i], 
then we have 


a e T 0 <4 a 71 e 7j. 

In this case tt is said to be a faithful interpretation of Tq into 7j. 

To return to an earlier example, consider the structures (N; 0, .S' ) and 
(Z; +, •). We had an interpretation tt into Th(Z; +, •), where 

ttv(x) = 3 yi 3 y 2 3 y 3 3 y 4 x = yi • yi + y 2 • yi + V 3 • V 3 + >’4 • 74, 

TT 0 (x) = X + X — X, 

tts(x, y) = Vz(z • z = z A z + z # z —»■ x + z = y). 

We now claim that tt is a faithful interpretation of Th(N; 0, S ) into 
Th(Z; +, •)• For in this case , 77 (Z; +, •) is the structure (N; 0, S). Hence 

t=(N;0,S) a S=A 1= 7T(Z ; +,-) a S=S \= (Z;+,.)<? 77 . 

In Chapter 3 we will be able to show that there is no interpretation of 
Th(Z; +, •) into Th(N; 0, S). Thus the former theory is strictly stronger 
than the latter. 



172 


A Mathematical Introduction to Logic 


Finally, let us return to the situation with which we started this section. 
Assume that T is a theory containing the sentence s, where 

s — V V\ 3lv2(p\ 

S — \/ v i V V2(fv i = *->■ (p)\ 

L + — the language obtained by adding the new function symbol / 
to the language of T ; 

re — the interpretation of L + into T that is the identity interpretation 
on all parameters except /, and jtf = ip. 

In fact, 7r is a faithful interpretation of Cn (T ; 8) into T, since, as 
noted previously, 

= Cn(T; 5). 

We can now draw an additional conclusion; the definition is eliminable. 

Theorem 27D Assume that we have the situation described above. 
Then for any L + -sentence cr we can find the sentence a 71 in the 
original language such that 

(a) T; 8 \= (cr <-► a 71 ). 

(b) T; 8 |= or T |= a 71 . 

(c) If / does not occur in a, then |= (a a 71 ). 

Proof. Part (c) follows from the fact that jt is the identity inter¬ 
pretation on all parameters except /. Part (b) restates that n is a 
faithful interpretation of Cn(T; 5) into T. Since tc is faithful, for 
(a) it suffices to show that 

T 1= (cr «— ct 7T ') 7T . 

This follows from (c), since (a <—* a 71 ) 71 is ( a 77 <—> a 7171 ), which is 
valid. H 


Exercises 

1. Assume that Lq and L\ are languages with the same parameters 
except that Lq has an n -place function symbol f not in L\ and L\ 
has an (n + l)-place predicate symbol P not in Lq. Show that for 
any Lo-theory T there is a faithful interpretation of T into some 
L i -theory. 

2. Let Lq be the language with equality and the two-place function 
symbols + and •. Let L \ be the same, but with three-place predicate 
symbols for addition and multiplication. Let Tt,- = (N; +, •) be the 
structure for L, consisting of the natural numbers with addition and 
multiplication (i — 0, 1). Show that any relation definable by an 
Lo-formula in Tlo is also definable by an L | -formula in 9Ti. 
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3. Show that an interpretation of a complete theory into a satisfiable 
theory is faithful. 


SECTION 2.8 _ 

Nonstandard Analysis 1 

The differential and integral calculus was originally described by 
Leibniz and Newton in the seventeenth century in terms of quantities 
that were infinitely small yet nonzero. Newton used in his calculations 
a number o that, being infinitely small, could be multiplied by any finite 
number and still be negligible. But it was necessary to divide by o, so it 
had to be nonzero. Leibniz’s dx was less than any assignable quantity, 
yet was nonzero. 

These ideas were not easy to comprehend or to accept. Throughout 
the eighteenth century this business of working with infinitesimals was 
attacked (e.g., by Bishop Berkeley), distrusted (e.g., by D’Alembert), 
and used in enthusiastic experimentation (e.g., by Euler). While Euler 
was creating the mathematics that students now study in advanced cal¬ 
culus, he used infinitesimals in a loose, free-swinging manner that would 
not be tolerated in today’s freshmen. Only in the nineteenth century were 
the foundations of calculus presented in the form now found in text¬ 
books. The treatment of limits was then rigorous, and debate subsided. 

In 1961 Abraham Robinson introduced a new method for treating 
limits, rescuing infinitesimals from their intellectual disrepute. This 
method combines the intuitive advantages of working with infinitely 
small quantities with modern standards of rigor. The basic idea is to 
utilize a nonstandard model of the theory of the real numbers. 

Construction of *93 

We will use a very large first-order language. In addition to symbols 
for +, •, and < we might as well add symbols for the exponentiation 
and absolute-value functions. And since there is no good reason to stop 
there, we go all the way and include a symbol for every operation on the 
set R of reals. We do the same for every relation on R. Thus we have 
the language with equality and the following parameters: 

0. V, intended to mean “for all real numbers.” 

1. An n-place predicate symbol Pr for each n-ary relation R on R. 

2. A constant symbol c r for each r e R. 

3. An n-place function symbol f F for each n-ary operation F on R. 


1 This section may be omitted without loss of continuity. 
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For this language there is the standard structure IK, with |91| = R, 
P^ — R, cf — r, and f'f — F. But now let us form a nonstandard 
structure, by using the compactness theorem. Let T be the set 

Th91 U {c,P < V\ | r e R}. 

(Here c, P < v\ formalizes “r is less than V \.”) Any finite subset of T can 
be satisfied in 91 by assigning to V\ some large real number. Hence by 
the compactness theorem there is a structure 21 and an element a e |2l| 
such that T is satisfied in 21 when V\ is assigned a. Since 21 is a model 
of Th 91, we have 21 = 91. There is also an isomorphism h of 91 into 
(but not onto) 21 defined by 

h(r) = cf. 

To check that this is indeed an isomorphism, we use the fact that 21 = 91. 
h is one-to-one, since for r\ ^ n, the sentence c n cy 2 holds in 91 and 
hence in 21. h preserves a binary relation R(— P F ) since for any r and 
s in R, 

{r , s) e P’s Nm Prc, c s 
O 1=21 PrCiCs 
O (c?,c?)eP» 

&(h(r),h(s)) ePl 

A similar calculation applies to any n-ary relation. Next we must show 
that h preserves any function F(— /^). Again for notational ease, sup¬ 
pose that F is a binary operation. Consider any r and s in R, and let 
t — F(r , s). Then 

h(f*(r,s))=h(F(r , S )) 



Now the sentence c, — f F c, c s holds in 91 and hence in 21. Hence 

c?=f*(cf,c?) 

=ft(h(r),h(s)). 

So h preserves f F . For the constant symbols we have by the definition 
of h, 

h(c^) —h(r) 

—/- 21 
— c r . 

Since we have an isomorphic copy of 91 inside 21, we can find an¬ 
other structure *91 isomorphic to 21 such that 91 is a substructure of *91. 
The idea is simply to replace in 21 the point cf by the point r (provided 
that |2l| fl R = 0, as can always be arranged). For details, see Exer¬ 
cise 24 of Section 2.2. Since *91 is isomorphic to 21, there is a point 
b e |*91| such that *91 satisfies T when V\ is assigned b. In particular, 
*91 = 91. 
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To go much further, we need a less cumbersome notation. We will 
use an asterisk to indicate passage from 95 to *95. 

1. For each n -ary relation R on R, let * R be the relationassigned 
to the symbol P R by *95. In particular, R is a unary relation on R. Its 
image *R equals the universe of *95, since the sentence V xPgx is true 
in 95 and hence in *95. Since 95 is a substructure of *95, we have that 
each relation R equals the restriction of *R to R. 

2. For each n- ary operation F on R, let *F be the operation / A ?' 
assigned to the symbol/^ by *95. F is then the restriction to R of * F. 

Observe that c r m — r, so we need no special notation for this. 

There is a general method (to be used heavily in the remainder of this 
section) for demonstrating properties of a relation * R or an operation 
*F. One simply observes (1) that R or F has the property, (2) that the 
property can be expressed by a sentence of the language, and (3) that 
95 = *95. 

For example, the binary relation *< on *R is transitive. This is because 
< is transitive, and this property can be expressed by the sentence 

Vx Vy Vz(xP<y —* yP < z —* xP < z). 

By similar reasoning, *< satisfies trichotomy on *R and thus is an 
ordering relation on *R. 

For another example, we can prove that the binary operation *+ on 
*R is commutative, since + is commutative and the commutative law 
can be expressed by a sentence. By applying this reasoning to each of 
the field axioms, we see that (*R; 0, 1, *+, *•) is a field. 

This general method is used so much, we will shortly begin to take 
it for granted. If, for example, we assert that *|a *+ b\ *< *|a| *+ *|Z?| 
for a and b in *R, we will take for granted that the reader perceives that 
the general method yields this fact. 

We have R C *R, but R ^ *R. For we have some point b such that 
[=» 9 i £,./*<i>i [[/?]]; i.e., r *< b. Thus b is infinitely large, being larger 
(in the ordering *<) than any standard r, i.e., any r e R. Its reciprocal 
1 */ b will be a sample infinitesimal. 

Properties of 95 that cannot be expressed in the language are likely 
to fail in *95. The least-upper-bound property is one such. There are 
non-empty bounded subsets S of *R that have no least upper bound 
(with respect to the ordering * <). For example, R is such a subset of 
*R. It is bounded by the infinite b of the preceding paragraph. But it has 
no least upper bound; see Exercise 7. 

Define the set T of finite elements by the equation 


F={re *R | *|x| *< y for some y e R}. 
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Similarly, define the set X of infinitesimals by the equation 

1 = {x e *R | * |x | *< y for all positive y e R}. 

If A C R is unbounded, then * A contains infinite points. For the 
sentence "for any real r there is an element a e A larger than r” is true 
and formalizable. Take some infinite positive b\ there must be a larger 
(and hence infinite) member of * A. For example, *N contains infinite 
numbers. 

The only standard infinitesimal, i.e., the only member of R fl X, is 0. 
But there are other infinitesimals. For by the usual (formalizable) rules 
for inequalities, the reciprocal of any infinite number is an infinitesimal. 

Algebraic Properties 

In the next theorem we collect some algebraic facts about T and X that 
will be useful later. 

Theorem 28A (a) T is closed under addition *+, subtraction *— , 

and multiplication *•. 

(b) X is closed under addition *+, subtraction *—, and multi¬ 
plication from T\ 

.tel and ze(F=^x*-zeX. 

In algebraic terminology, part (a) says that T is a subring of the field 
*R, and part (b) says that X is an ideal in the ring T. We will see a little 
later what the quotient ring T jX is. 

Proof, (a) Let x and y be finite, so that *|x| *< a, *|y| *< b for 
standard a and b in R. Then 

*\x*±y\*<*\x\*+ m \y\*< a + b eR, 
whence x *+ y, x *— y are finite. Also 
*\x*- y| *< a ■ b e R, 
whence x *• y is finite, too. 

(b) Let x and y be infinitesimals. Then for any positive standard 
a, *\x\ *< a/2 and *|y| *< a/2. Hence 

*|x *± y\ *< a/2 + a/2 = a, 

so that x *+ y and x *— y are infinitesimal. If z is finite, then 
*|z| *< b for some standard b. Since x is infinitesimal, we have 
*\x\ *< a/b, whence 

*|JC *- zl *< (ci/b)b = a. 


Thus x *■ z is also infinitesimal. 
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Definition, x is infinitely close to y (written x — y) iff x *— y is 
infinitesimal. 

Theorem 28B (a) ~ is an equivalence relation on *R. 

(b) If u — v and x ~ y, then u *+ x ~ v *+ y and *— u — 
*— v. 

(c) If u — v and x ~ y and x,y,u, v are finite, then u *• x ~ 
v *■ y. 

Proof. This is a consequence of part (b) of the preceding theorem 
(1 is an ideal in T). 

(a) ~ is reflexive since 0 is infinitesimal. ~ is symmetric since 
the negative (*—) of an infinitesimal is infinitesimal. Finally, sup¬ 
pose that x — y and y ~ z- Then 

x *- z = (x*-y) *+ ( y*-z)el 

since X is closed under addition. 

(b) If u — v and x ~ y, then 

(u *+ x) *- (v *+ y) = (u *- v ) *+ (x *-y)eX 

since X is closed under addition. Also *— u — *— v since X is 
closed under negation. 

(c) (u *• x) *- (v *■ y) = (u *■ x) *- (u *■ y) *+ (u *• y) *- (v *■ y) 
= u *■ fix *- y) *+ (u *— v) *■ y e X 

since X is closed under multiplication from T. H 

For standard r and s, we have r ~ .v iff r — .y, as 0 is the only 
standard infinitesimal. 

Lemma 28C If x qt y and at least one is finite, then there is a 
standard q strictly between x and y. 

Proof. We may suppose that x *< y. In fact, we may further 
suppose that 0 *< x *< y; the case x *< y *< 0 is similar and the 
case x *< 0 *< y is trivial. Since x y there is a standard b such 
that 0 < b *< y* — x. Since x is finite we have x *< mb for some 
positive integer m; take the least such m. Then x *< mb *< y. 
(By the leastness of m, (m — 1 )b *< x. So mb *< x *+ b *< y.) 

' H 


Theorem 28D Every x e T is infinitely close to a unique r e R. 
Proof. For x e T the set 

S = {y e M | y *< x] 

of standard points below x has an upper bound in R. Let r be its 
least upper bound; we claim that x — r. 
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If x qt r, then by the lemma there is a standard q between x 
and r. If r < q *< x, then r fails to be an upper bound for S. If 
x *< q < r, then q is also an upper bound for S, contradicting 
the leastness of r. Hence x — r. 

This establishes the existence of r. As for the uniqueness, note 
that if x — r and x — s, then r ~ .v. For standard r and s this 
implies that r = s. H 

Corollary 28E Each finite x has a unique decomposition of the 
form x = s*+ i, where s is standard and i is infinitesimal. 

We call s the standard part of x, written st(x). (Another notation for 
the standard part of x is °x.) Of course for standard r, st(r) = r. The 
next theorem summarizes some properties of the st function. 

Theorem 28F (a) st maps T onto R. 

(b) st(x) = 0 iff x is infinitesimal. 

(c) st(x *+ y) — st(x) + st(v). 

(d) st(x *• y) — st(x) • st(y). 

Proof, (a) and (b) are clear. Since st(x) ~ x and st(y) ~ y, we 
have by part (b) of Theorem 28B that st(x) + st(y) ~ x *+ y. 
Hence the left side equals st(x *+ y). Part (d) is similar and uses 
part (c) of Theorem 28B. H 

(In algebraic terminology, this theorem asserts that st is a homomor¬ 
phism of the ring T onto the field E, with kernel X. Consequently, the 
quotient ring T /X is isomorphic to the real field R.) 

Henceforth in this section we will streamline our notation by omitting 
the asterisks on the symbols for the arithmetic operations *+, , *•, 

and */. 

Convergence 

In calculus courses convergence is usually treated in terms of e’s and S ’s 
and variables that come delicately close to certain values. We will give 
here the beginning of an alternative treatment of convergence, where 
variables come infinitely close to the limiting values. 

Definition. Let F : R —> R. Then F converges at a to b iff when¬ 
ever x is infinitely close to (but different from) a , then * F(x) is 
infinitely close to b. 

Proof of Equivalence with the Ordinary Definition. First suppose 
that F converges at a to h in the ordinary sense. That is, for any 
e > 0 there is a 8 > 0 such that 

0 / |x — a\ < S =>■ |b — F(x)\ < e for any x. 

The displayed sentence (concerning the standard numbers e and 
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<5) is formalizable and thus holds in *9t. Now if x in *R is infinitely 
close to (but different from) a, then certainly 0 ^ *|x — a\ *< S. 
Hence *\b — *F(x)| *< e. Since e was arbitrary, b ~ *F{x). 

Conversely, assume that the condition stated in the definition 
is met. Then for any standard s > 0, the sentence 

There exists 8 > 0 such that for allx, 0^ \a — x| < S =£> \b— F(x)\ < s 

(when formalized) holds in *91, since we can take <5 to be infinites¬ 
imal. Hence the sentence holds in 91 also. H 

First remark: It is entirely possible that F does not converge at a to 
any number. On the other hand, F converges at a to at most one b. For 
if i is a nonzero infinitesimal, then b = st(* F(a + i)). It is traditional 
to denote this b by “lim v ^ fl F(x).” Thus 

lim F(x) — st (*F(a + i)). 

x—>a 

Second remark: It is not really necessary to have dom F = M. It is 
enough for a to be an accumulation point of dom F. (a is an accumu¬ 
lation point of S iff a is infinitely close to, but different from, some 
member of *S.) 

Corollary 28G F is continuous at a iff whenever x ~ a, then 
*F(x) ~ F(a). 

Now consider a function F : R -> R and a standard a e I. Then 
the derivative F'(a) is 

F(a + h) — F(a) 

lim- 

h 

By our definition of limit, this can also be stated: F'(a) — b iff for every 
nonzero infinitesimal dx we have dF/dx — b, where dF — *F(a + 
dx) — F(a). Thus if there is such a b (i.e., if F'(a) exists), then 

F\a) = st(< dF/dx) 

for any nonzero infinitesimal dx. Here dF/dx is the result of dividing 
dF by dx. The fact that we simply use division here greatly facilitates 
calculations. 

Example. Let F(x) — x 2 . Then F'(a) = 2a, since 

dF (a + dx) 2 — a 2 2a(dx) + (dx) 2 

— = -= - — 2a + dx — 2a. 

dx dx dx 

Theorem 28H If F’{a ) exists, then F is continuous at a. 

Proof. For any nonzero infinitesimal dx we have 

*F{a + dx) — F(a) 


dx 


F'(a). 



180 


A Mathematical Introduction to Logic 


The right side is standard, so the left side is at least finite. Con¬ 
sequently, when we multiply the left side by the infinitesimal 
dx we are left with the fact that *F(a + dx) — F(a ) e X; i.e., 
*F(a + dx) — F(a). H 


The reader should note that this result is not some nonstandard ana¬ 
logue of a classical theorem, nor even a generalization of a classical 
theorem. It is a classical theorem. It is only the proof that is nonstan¬ 
dard. The same remarks apply to the next theorem. Let FoGbe the 
function whose value at a is F(G(a )). 


Chain Rule Assume that G'(a) and F'(G(a)) exist. Then (F o 
G)'(a) exists and equals F'(G(a)) ■ G'(a). 


Proof. First, notice that *(F o G) —* F o *G, since the sentence 
V V\f FoG V\ = f F f c v i holds in the structures. Now consider any 
nonzero infinitesimal dx. Let 

dG — *G(a + dx) — G(a ), 

dF — *(F o G)(a + dx) — ( F o G)(a) 

= *F(*G(a + dx))~ F(G(a)) 

= *F(G(a)+dG) - F(G(a)). 


Then dG ~ 0 since G is continuous at a. If dG ^ 0, then by the 
last of these equations dF/dG — F'(G(a)), whence 


dF 

dx 


dF dG 
dG dx 


F'(G(a)) • G'(a). 


If dG = 0, then dF = 0 and G'(a) ~ dG/dx — 0, so we again 
have 

dF 

— ~ F'(G(a)) • G (a). 

dx _i 


These theorems are but samples of the treatment of convergence in 
terms of infinite proximity. The method is not at all limited to elemen¬ 
tary topics. One can construct delta functions S with the property that 
f S — 1 and yet S(x) — 0 for x ^ 0. Original results in analysis 
(e.g., in the theory of Hilbert spaces) have been obtained by the method 
of nonstandard analysis. Possibly the method will be more widely used 
in the future as more analysts become familiar with it. 


Exercises 

1. (Q is dense in R.) Let Q be the set of rational numbers. Show that 
every member of *R is infinitely close to some member of *Q. 

2. (a) Let Act and F : A —> R. Then F is also a binary relation on 

R; show that *F : *A *R. 

(b) Let S : N — > R. Recall that S is said to converge to b iff for every 
e > 0 there is some k such that for all n > k, \S(n) — b\ < s. 
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Show that this is equivalent to the condition: *S(x) — b for every 
infinite x e *N. 

(c) Assume that S, : N —>■ R and S t converges to /?, for i = 1,2. 
Show that 5) + S 2 converges to b\ + b 2 and Si • S 2 converges to 
b 1 • b 2 . 

3. Let F : A -> R be one-to-one, where Act. Show that if x e *A 
but x A, then *F(x) R. 

4. Let A C R. Show that A = * A iff A is finite, 

5. (Bolzano-Weierstrass theorem) Let A c R be bounded and infinite. 
Show that there is a point p € R that is infinitely close to, but 
different from, some member of * A. Suggestion: Let S : N —> A 
with S one-to-one; look at *S(x) for infinite x e *N. 

6 . (a) Show that *Q has cardinality at least 2 K °, where Q is the set of 

rational numbers. Suggestion: Use Exercise 1. 

(b) Show that *N has cardinality at least 2^°. 

7. Let A be a subset of R having no greatest member. Then as a subset 
of *R, A will have upper bounds (with respect to the ordering *<) 
in *R. But show that A does not have a least such bound. 
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SECTION 3.0_ 

Number Theory 

In this chapter we will focus our attention on a 
specific language, the language of number theory. 
This will be the first-order language with equality 
and with the following parameters: 

V, intended to mean “for all natural numbers.” 
(Recall that the set N of natural numbers is the set 
{0, 1, 2,...}. Zero is natural.) 

0 , a constant symbol intended to denote the 
number 0. 

S, a one-place function symbol intended to de¬ 
note the successor function S : N —»■ N, i.e., the 
function for which S(n) = n + 1. 

<, a two-place predicate symbol intended to 
denote the usual (strict) ordering relation on N. 

+, •, E, two-place function symbols intended 
to denote the operations +, •, and E of addition, 
multiplication, and exponentiation, respectively. 

We will let 91 be the intended structure for this 
language. Thus we may informally write 

91= (N; 0, S, E). 

(More precisely, |9!| = N, O’ 75 = 0, and so forth.) 

By number theory we mean the theory of this 
structure, Th 91. As warmup exercises we will 
study (in Sections 3.1 and 3.2) certain reducts of 91, 
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i.e., restrictions of tit to sublanguages: 

m s = (N; 0, S ), 

= (N; 0, S, <), 
m A = (N; 0, S, <, +). 

Finally, in Section 3.8 we will consider 

= (N; 0, 5, <,+,■)• 

For each of these structures we will raise the same questions: 

(A) Is the theory of the structure decidable? If so, what is a nice set 
of axioms for the theory? Is there a finite set of axioms? 

(B) What subsets of N are definable in the structure? 

(C) What do the nonstandard models of the theory of the structure 
look like? (By “nonstandard” we mean “not isomorphic to the intended 
structure.”) 

Our reason for choosing number theory (rather than, say, group 
theory) for special study is this: We can show that a certain subtheory of 
number theory is an undecidable set of sentences. We will also be able to 
infer that any satisfiable theory that is at least as strong as this fragment 
of number theory (e.g., the full number theory or set theory) must be 
undecidable. In particular, such a theory cannot be both complete and 
axiomatizable. 

In order to show that our subtheory of number theory is undecid¬ 
able, we will show that it is strong enough to represent (in a sense 
to be made precise) facts about sequences of numbers, certain opera¬ 
tions on numbers, and ultimately facts about decision procedures. This 
last feature then lets us perform a diagonal argument that demonstrates 
undecidability. 

We could alternatively use, in place of a subtheory of number theory, 
some other theory (such as a fragment of the theory of finite sets) in 
which we could conveniently represent facts about decision procedures. 

Before giving examples of the expressiveness of the language of num¬ 
ber theory, it is convenient to introduce some notational conventions. 
As a concession to everyday usage, we will write 

x < y, x + y, x-y, and xEy 

in place of the official 

<xy, +xy, -xy, and E xy. 

For each natural number k we have a term S^O (the numeral for k) that 
denotes it: 

S°0 = 0, S‘0 = S0, S 2 0 = SS0, etc. 

(The set of numerals is generated from {0} by the operation of 
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prefixing S.) The fact that every natural number can be named in the 
language will be a useful feature. 

Even though only countably many relations on N are definable in 01, 
almost all the familiar relations are definable. For example, the set of 
primes is defined in 01 by 


V\ ^ S'o A V V2 Vu3(ui = V2 • Vj —> V2 — S'O V V3 — S'O). 

Later we will find it important to show that many other specific relations 
are definable in 01 . 

One naturally expects the expressiveness of the language to be 
severely restricted when some of the parameters are omitted. For exam¬ 
ple, the set of primes, as we shall see, is not definable in 01 4 . On the 
other hand, in Section 3.8 we will show that any relation definable in 01 
is also definable in 01 m . 

Preview 

The main theorems of this chapter — the theorems associated with the 
names of Godel, Tarski, and Church — are proved in Section 3.5. But 
we can already sketch here some of the ideas involved. We want to 
compare the concepts of truth and proof ; that is, we want to compare 
the set of sentences true in 01 with the set of sentences that might be 
provable from an appropriate set A of axioms. 

We can assign to each formula a of the language of number theory 
an integer fa, called the Godel number of a. Any sufficiently straight¬ 
forward way of assigning distinct integers to formulas would suffice 
for our purposes; a particular assignment is adopted at the beginning of 
Section 3.4. What is important is that from a we can effectively find 
the number fa, and conversely. Similarly, to each finite sequence D of 
formulas (such as a deduction) we assign an integer Q(D). Note that for 
any set A of formulas, we can form the corresponding set {fa a e A) 
of numbers. 

There are now three ways in which to proceed: the self-reference ap¬ 
proach, the diagonalization approach, and the computability approach. 
It will be argued later, however, that the three approaches are more 
closely related than they appear — they are three aspects of one ap¬ 
proach. 

First, in the self-reference approach, we make a sentence a that can 
be thought of as saying, “I am unprovable.” More specifically, we have 
the following: 

Theorem 30A Let A C Th01 be a set of sentences true in 01, and 
assume that the set {fa | a e A} of Godel numbers of members 
of A is a set definable in 01. Then we can find a sentence a such 
that o is true in 01 but cr is not deducible from A. 
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Proof. We will construct a to express (in an indirect way) that a 
itself is not a theorem of A. Then the argument will go roughly as 
follows: If A h a, then what a says is false, contradicting the fact 
that A consists of true sentences. And so AY- a, whence a is true. 

To construct a , we begin by considering the ternary relation R 
defined by 

(a, b, c) e R iff a is the Godel number of some formula 
a and c is the value of Q at some 
deduction from A of oTS^O). 

Then because {fla | a e A} is definable in 'It, it follows that R 
is definable also. (The details of this step must wait until later 
sections.) Let p be a formula that defines R in 01. Let q be the 
Godel number of 

V V 3 VI, Vs). 

(We use here the notation: cp(t) = q>, <p(t \, to) — (< Pti')t "L and 
so forth.) Then let a be 

Vv3-.p(S«0,S ? 0, vs). 

Thus a says that no number is the value of Q at a deduction from 
A of the result of replacing, in formula number q, the variable 
V\ by the numeral for q\ i.e., no number is the value of Q at a 
deduction of a. 

Suppose that, contrary to our expectations, there is a deduc¬ 
tion of cr from A. Let k be the value of Q at a deduction. Then 
(q,q,k) e R and hence 

|= m p(S 9 0,S ? 0,S t 0). 

It is clear that 

a h ^p(S‘ ? 0, S^O.S^O) 

and the two displayed lines tell us that a is false in 01. But Aha 
and the members of A are true in 01, so we have a contradiction. 

Hence there is no deduction of cr from A. And so for every k, 
we have (q,q,k) £ R. Thus for every k 

Hn-pfS^O, S«0,S*0), 

from which it follows (with the help of the substitution lemma) 
that 

NnV^p(S"0,S‘'0,^); 

i.e., a is true in 01. H 

We will argue later — using something called Church’s thesis — that 
any decidable set of natural numbers must be definable in 01. The con¬ 
clusion will then be that Th 01 is not axiomatizable. 
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Corollary 30B The set {JJr | [=91 r} of Godel numbers of sentences 
true in (R is a set that is not definable in 'It. 

Proof. If this set were definable, we could take A — Th (R in the 
preceding theorem to obtain a contradiction. H 

Section 3.5 will follow the self-reference approach, but with a varia¬ 
tion in which the sentence a tries to say, "I am false.” (The well-known 
liar paradox is relevant here!) 

But if this “self-reference” construction seems too much like a magic 
trick, there is a second way to describe the situation: the diagonalization 
approach, which does not use an obvious self-reference. 

We start by defining the following binary relation P on the natural 
numbers: 

(a,b) e P •<=> a is the Godel number of a formula a(v i) 

(with just V\ free) and |=<n a(S , ’0). 

(More informally, (a, b) e P <£> “a is true of bP) Then any set of natural 
numbers that is definable in fR equals, for some a, the “vertical section” 

P a = {b | (a,b) e P} 

of P. Namely, we take a to be the Godel number of a formula defining 
the set, and use the fact that |= 9 j a(S fo 0) <£> |= 9 i cy(V i) [[Z? ]] . 

So any definable (in (R) set of natural numbers is somewhere on the 
list Pi, P 2 ,.... Now we “diagonalize out” of the list. Define the set: 

H = {b | {b, b) $ P }. 

(More informally, b e H <£> “b is not true of hP) Then H is nowhere 
on the list Pi, P 2 ,....(// ^ P 3 because 3 e H <£> 3 ^ P 3 , so the 
number 3 belongs to exactly one of these two sets and not to the other.) 
Therefore H is not definable in 'Jt. 

Why is H undefinable? After all, we have above specified that 

b e H •<=> not [b is not the Godel number of a formula o/(v\) 
(withjust V\ free) and 1=91 ajS^O)]. 

What is the barrier to translating this specification into the language of 
arithmetic? We will show that the barrier is not being the Godel number 
of a formula — we can translate that — and the barrier is not having V\ 
free and not substituting the numeral S*0 into a formula. By the process 
of elimination, we will show that the only possible barrier is saying of 
a sentence that is true in 'Jl. 

Theorem 30C (a) The set {tJr | 1 = 9 ;r} of Godel numbers of sen¬ 

tences true in (R is not definable in (R. 

*(b) The theory Th (R is undecidable. 

*(c) The theory Th 'Ji is not axiomatizable. 
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Proof. Part (a), which is the same as Corollary 30B in the self- 
reference approach, has the diagonal proof sketched above. That 
is, if to the contrary Th 93 were definable in 91, then the above set 
H would also be definable, which it is not. 

Part (b) will then follow, after we argue every decidable set 
of natural numbers must be definable in 91. If Th 91 were decid¬ 
able, then the corresponding set {$r | 1=^ r} of numbers would 
be decidable and hence definable in 91, which it is not. 

And part (c) is an immediate consequence of part (b) and 
Corollary 261, since Th 91 is a complete theory. H 

And thirdly, the computability approach presents us with a stark 
difference between what is true and what is provable. From Section 2.6 
we know that whenever A is a decidable set (or even an effectively 
enumerable set) of axioms we might choose for Th 91, the set Cn A of 
provable sentence will be an effectively enumerable set. 

In contrast, the computability approach will show — using Church’s 
thesis — that the set Th 91 of all true sentences is not effectively enu¬ 
merable. This fact, which is closely related to Theorem 30C, will follow 
from another diagonal argument in Section 3.6. 

*T heorem 30D For any decidable (or even effectively enumerable) 
set A of axioms, 


Cn A ^ Th91 

because the set on the left is effectively enumerable and the set 
on the right is not. 

Theorem 30D presents the dilemma: Either the axioms are lying to 
us by allowing us to deduce false sentences, or else the axioms are 
incomplete, in the sense that some true sentences cannot be deduced 
from those axioms. 

This computability approach is implicit in parts of Section 3.5, but 
it is in Section 3.6 that the approach explicitly appears, and where it is 
compared with the other two approaches. 


SECTION 3.1 _ 

Natural Numbers with Successor 

We begin with a situation that is simple enough to let us give reasonably 
complete answers to our questions. We reduce the set of parameters 
to just V, 0, and S, eliminating <, +, •, and E. The corresponding 
reduct of 91 is 


91 s = (N; 0, S). 
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In this restricted language we still have the numerals, naming each point 
in N. But the sentences we can express in the language are, from the 
viewpoint of arithmetic, uninteresting. 

We want to ask about 91 $ the same questions that interest us in the 
case of 91. We want to know about the complexity of the set Th 91$; we 
want to study definability in 91^; and we want to survey the nonstandard 
models of 91$. 

To study the theory of the natural numbers with successor (Th 91$), 
we begin by listing a few of its members, i.e., sentences true in 91s. 
(These sentences will ultimately provide an axiomatization for the 
theory.) 

51. V.r Si ^ 0, a sentence asserting that zero has no predecessor. 

52. Vx V y(Sx = Sy —» x = y). This asserts that the successor 
function is one-to-one. 

53. V y(y ^ 0—>3 x y — Sx). This asserts that any nonzero number 
is the successor of something. 

54.1 VxSx^x. 

54.2 VxSSx^x. 

S4.« Vx S"x yi x, where the superscript n indicates that the symbol 
S occurs at n consecutive places. 

Let A s be the set consisting of the above sentences SI, S2, S3, S4.n 
(n — 1,2,...). Clearly these sentences are true in 91$; i.e., 91$ is a 
model of Aj. Hence 


Cn As c Th91 s . 

(Anything true in every model of As is true in this model.) What is 
not so obvious is that equality holds. We will prove this by considering 
arbitrary models of Aj. 

What can be said of an arbitrary model 

21 = (|2l|; 0 21 , S a ) 

of the axioms As? S 21 must be a one-to-one map of |2l| onto |2l| — {0 21 }, 
by SI, S2, and S3. And by S4.«, there can be no loops of size n. Thus 
|2l| must contain the “standard” points: 

0 21 -> S 21 ^ 21 ) -> S 2l (S 2t (0 21 )) -*..., 

which are all distinct. The arrow here indicates the action of S 21 . There 
may or may not be other points. If there is another point a in |2l|, then 
there will be the successor of a, its successor, etc. Not only that, but 
since (by S3) each nonzero element has a predecessor (something of 
which it is the successor) which is (by S2) unique, |2l| must contain 
the predecessor of a, its predecessor, etc. These must all be distinct lest 
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there be a finite loop. Thus a belongs to a “Z-chain”: 

a -> S a (a) S a (S a (a)) 

(We refer to these as Z-chains because they are arranged like the set 
Z of all integers {..., — 1, 0, 1, 2, ...}.) There can be any number of 
Z-chains. But any two Z-chains must be disjoint, as S2 prohibits merg¬ 
ing. Similarly, any Z-chain must be disjoint from the standard part. 

This can be restated in another way. Say that two points a and b in |2t| 
are equivalent if the function S a can be applied a finite number of times 
to one point to yield the other point. This is an equivalence relation. (It 
is clearly reflexive and symmetric; the transitivity follows from the fact 
that S' 1 is one-to-one.) The standard part of 21 is the equivalence class 
containing 0 a . For any other point (if any) a in |2t|, the equivalence class 
of a is the set generated from {a} by S a and its inverse. This equivalence 
class is the Z-chain described above. 

Conversely, any structure 03 (for this language) that has a stan¬ 
dard part 

o® -» s®(0®) S*(S*(0®)) -> • • • 


and a nonstandard part consisting of any number of separate Z-chains 
is a model of A$. (Check through the list of axioms in As, and note that 
each is true in 03.) We thus have a complete characterization of what 
the models of As must look like. 

If a model 21 of As has only countably many Z-chains, then |2t| 
is countable. In general, if the set of Z-chains has cardinality 1 X, then 
altogether the number of points in 12l| is X ( > + K () ■ /,. By facts of cardinal 
arithmetic (cf. Chapter 0) this number is the larger of Ko and X. Hence 

d f Ko if 21 has countably many Z-chains, 

[ X if 21 has an uncountable number X of Z-chains. 

Lemma 31A If 21 and 21' are models of As having the same number 
of Z-chains, then they are isomorphic. 

Proof. There is a unique isomorphism between the standard part 
of 21 and the standard part of 2f. By hypothesis we are given a 
one-to-one correspondence between the set of Z-chains of 21 and 
the set of Z-chains of 21'; thus each chain of 21 is paired with a chain 
of 21'. Clearly any two Z-chains are isomorphic. By combining 
all the individual isomorphisms (which uses the axiom of choice) 
we have an isomorphism of 21 onto 2l\ H 

Thus a model of As is determined to within isomorphism by its num¬ 
ber of Z-chains. For 91s this number is zero, but any number is possible. 


1 To avoid uncountable cardinals, see Exercise 3. 
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The reader should note that there is no sentence of the language which 
says, “There are no Z-chains.’’ In fact, there is no set E of sentences 
such that a model 21 of As satisfies E iff 21 has no Z-chains. For by the 
LST theorem there is an uncountable structure 21 with 21 = Dlj. But 21 
has uncountably many Z-chains and IBs has none. 

Theorem 31 B Let 21 and 03 be uncountable models of As of the 
same cardinality. Then 21 is isomorphic to 03. 

Proof. By the above discussion, 21 has card 21 Z-chains, and 03 
has card 03 Z-chains. Since card 21 = card '13. they have the same 
number of Z-chains and hence are isomorphic. H 

Theorem 31C Cn As is a complete theory. 

Proof. Apply the Los-Vaught test of Section 2.6. The preceding 
theorem asserts that the theory Cn As is categorical in any un¬ 
countable power. Furthermore, As has no finite models. Hence 
the Los-Vaught test applies. H 

Corollary 31D Cn As = Th 03s. 

Proof. We have Cn A s C Th 'Jt v : the first theory is complete and 
the second is satisfiable. H 

*Corollary 31E Th(Bs is decidable. 

Proof. Any complete and axiomatizable theory is decidable (by 
Corollary 25G). As is a decidable set of axioms for this theory. 

H 


Elimination of Quantifiers 

Once one knows a theory to be decidable, it is tempting to try to find a 
realistically practical decision procedure. We will give such a procedure 
for ThDIs, based on “elimination of quantifiers.” 

Definition. A theory T admits elimination of quantifiers iff for 
every formula (p there is a quantifier-free formula i [r such that 

T \= (<p •*-»■ x(r). 

Actually it is enough to consider only formulas i/r of a rather special 
form: 

Theorem 31F Assume that for every formula q> of the form 
3 x(ao A • • • A a„), 

where each a, is an atomic formula or the negation of an atomic 
formula, there is a quantifier-free formula i/f such that T (= 
(cp <—> i/f). Then T admits elimination of quantifiers. 
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Proof. First we claim that we can find a quantifier-free equivalent 
for any formula of the form 3x6 for quantifier-free 6 . We begin 
by putting 6 into disjunctive normal form (Corollary 15C). The 
resulting formula, 

3i[(«o A • • • A ot m ) V (A) A • • • A A,) V • • • V (§ 0 A • • • A &)], 
is logically equivalent to 

3 .r(cro A • • • A a m ) V 3 x(/3o A • • • A /3„) V • • • 

V 3 x(f 0 A • • • A $,). 

By assumption, each disjunct of this formula can be replaced by 
a quantifier-free formula. 

We leave it to the reader to show (in Exercise 2) that by using 
the above paragraph one can obtain a quantifier-free equivalent 
for an arbitrary formula. H 

In the special case where the theory in question is the theory Th 21 of 
a structure 21, the definition can be restated: Th 21 admits elimination of 
quantifiers iff for every formula <p there is a quantifier-free formula i fr 
such that <p and are “equivalent in 21”; i.e., 

Na (<P <-> 

for any map .v of the variables into |2l|. 

Theorem 31G Th 91$ admits elimination of quantifiers. 

Proof. By the preceding theorem, it suffices to consider a formula 
3 .v(ao A • • • A a q ), 

where each a, is atomic or is the negation of an atomic formula. 
We will describe how to replace this formula by another that is 
quantifier-free. The equivalence of the new formula to the given 
one will, in fact, be a consequence of A$; see Exercise 3. 

In the language of Ttj the only terms are of the form S k u, where 
u is 0 or a variable. The only atomic formulas are equations. We 
may suppose that the variable x occurs in each a,-. For if x does 
not occur in a, then 

3 x(a A P) |==| a A 3 x 

Thus each a,- has the form 

S m x = S" u 

or the negation of this equation, where u is 0 or a variable. We 
may further suppose u is different from .r, since S'"x = S"x could 
be replaced by 0 = 0 if m = n, and by 0 ^ 0 if m ^ n. 
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Case 1: Each a,- is the negation of an equation. Then the formula 
may be replaced by 0 = 0. (Why?) 

Case 2: There is at least one a, not negated; say ao is 

S'”* = f, 

where the term t does not contain x. Since the solution for * must 
be non-negative, we replace ao by 

Af#S m_1 0 

(or by 0 = 0 if m — 0). Then in each other we replace, say, 
S k .x — u 

first by 

S k+m x = S m u, 

which in turn becomes 

S k t — S m u. 

We now have a formula in which x no longer occurs, so the quan¬ 
tifier may be omitted. H 

There are several interesting by-products of the quantifier elimi¬ 
nation procedure. For one, we get an alternative proof of the com¬ 
pleteness of CnAj. Suppose we begin with a sentence a. The 
quantifier elimination procedure gives a quantifier-free sentence r such 
that (by Exercise 3) As |= (a o- r). Now we claim that either As |= r 
or As |= -it. For r is built up from atomic sentences by means of —i 
and —k An atomic sentence must be of the form S A 0 = S ; 0 and is de- 
ducible from As if k — l, but is refutable (i.e., its negation is deducible) 
from As if k ^ l. (In fact, just {SI, S2} suffices for this.) Since every 
atomic sentence can be deduced or refuted, so can every quantifier- 
free sentence. This establishes the claim. And so either As |= a or 
As 1= ^ o'- 

Another by-product concerns the problem of definability in 91s; see 
Exercises 4 and 5. For any formula (p in which just V\ and v 2 occur free 
we now can find a quantifier-free i// (with the same variables free) such 
that 


i.e., 


ThOls |= V V\ V v 2 (<p i/r); 


Ntn s V vi V v 2 {q> \jr). 


Thus the relation <p defined is also definable by a quantifier-free formula. 
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Exercises 

1. Let A* s be the set of sentences consisting of S1, S2, and all sentences 
of the form 

<P( 0) —> V v\{(p{vi) —>• p(Si>i)) —>• V Vi <p{v i), 

where q> is a wff (in the language of 91s) in which no variable except 
V\ occurs free. Show that A s c Cn ,4J. Conclude that CnAJ = 
Th 91y- (Here (pit) is by definition (p] h . The sentence displayed above 
is called the induction axiom for <p.) 

2. Complete the proof of Theorem 3 IF. Suggestion: Use induction. 

3. The proof of quantifier elimination for Th91s showed how, given a 
formula q>, to find a quantifier-free i jr. Show that 

1 = (?><->• 

without using the completeness of Cn A s . (This yields an alternative 
proof of the completeness of Cn As, not involving Z-chains or the 
Los-Vaught test.) 

4. Show that a subset of N is definable in 9Ts iff either it is finite or its 
complement (in N) is finite. 

5. Show that the ordering relation {{m, n) \ m < n in NJ is not definable 
in 91s- Suggestion: It suffices to show there is no quantifier-free 
definition of ordering. Call a relation D C f)xN linear if it can 
be covered by a finite number of lines. Call R colinear if it is the 
complement of a linear relation. Show that any relation definable 
in 91s is either linear or colinear. And that the ordering relation is 
neither linear nor colinear. 

6. Show that Th 91s is not finitely axiomatizable. Suggestion: Show that 
no finite subset of A s suffices, and then apply Section 2.6. 


SECTION 3.2 _ 

Other Reducts of Number Theory 1 

First let us add the ordering symbol < to the language. The intended 
structure is 

91/, = (N; 0, S , <). 


1 This section may be omitted without disastrous effects. 
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We want to show that the theory of this structure is (like Th 91$) decidable 
and also admits elimination of quantifiers. But unlike Th 91s, it is finitely 
axiomatizable and is not categorical in any infinite cardinality. 

As axioms of Th9Iz. we will take the finite set Al consisting of the 
six sentences listed below. Here x < y is, of course, an abbreviation for 
(x < y V x = y), and x y abbreviates the negation of this formula. 


V y (y ^ 0 —> 3 xy — Sx) (S3) 

Vx V y (x < Sy <-*• x < y) (LI) 

Vx x^O (L2) 

VxVv (x<yVx = yVy<x) (L3) 

V x V y (x < y —* v ^ x) (L4) 

Vx Vy Vz (x < y —> y < z —* x < z) (L5) 


On the one hand, it is easy to see that all six axioms are true in 91/,. 
Thus Cn Ai C Th 91/. On the other hand, the opposite inclusion is not 
obvious, and requires proof. We begin by listing some consequence of 
these axioms. 

(1) A l F Vx x < Sx. 

Proof. In LI take y to be x. 3 

(2) A l h Vxx < x. 

Proof. In L4 take y to be x. H 

(3) A l PVxVy(r^y<->y<x) (trichotomy). 

Proof. For “—»•” use L3. For “*—” use L4 and (2). H 

(4) A l F Vx Vy(x < y ■*-> Sx < Sy). 

Proof. From Al we can deduce the biconditionals: 

x < y <-»■ y ^ x by (3); 

y Sx by LI; 

^ Sx < y by (3); 

Sx < Sy by LI. 

(5) A l F SI and A L F S2. 

Proof. SI follows from L2 and (1). S2 comes from (4) by use of 
L3 and (2). H 


(6) Al F S4.n for n = 1,2,_ 

Proof. This follows from (1) and (2), by using L5. 
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Thus any model 21 of Ai is (when we ignore < a ) also a model of 
A s . So it must consist of a standard part plus zero or more Z-chains. In 
addition, it is ordered by < a . 

Theorem 32A The theory Cn Ai admits elimination of quantifiers. 

Proof. Again we consider a formula 

3x(A> A • • • A P p ), 

where each ft, is atomic or the negation of an atomic formula. The 
terms are, as in Section 3.1, of the form S k u, where u is 0 or a 
variable. There are two possibilities for atomic formulas, 

S k u = S't and S k u<S l t. 

1. We can eliminate the negation symbol. Replace t\ ^ to by 
ti < h V t\ = t 2 and replace t\ ^ to by t\ < t 2 V ?2 < t\. (This is 
justified by L3 and L4.) By regrouping the atomic formulas and 
noting that 

3 x{tp A i/ r ) t==l 3 x tp A 3 x 
we may again reach formulas of the form 
3 x{aa A • • • A a q ), 
where now each a, is atomic. 

2. We may suppose that the variable x occurs in each a,. This 
is because if x does not occur in a, then 

3 x(a A ft) |==] a A 3 x ft. 

Furthermore, we may suppose that x occurs on only one side of 
the equality or inequality a,. For S k x — S'x can be dealt with as 
in Section 3.1. S k x < S 1 x can be replaced by 0 = 0 if k < /, and 
0 ^ 0 otherwise. (This is justified by LI and L4.) 

Case 1: Suppose that some a, is an equality. Then we can 
proceed as in case 2 of the quantifier-elimination proof of 
Theorem 31G. 

Case 2: Otherwise each a,- is an inequality. Then the formula 
can be rewritten 



(Here /\ ( indicates the conjunction of formulas indexed by i, so 
Vo A Y\ A • • • A Yk can be abbreviated /\ ( y ( -.) In the first con¬ 
junction, f\ i t, < S" !i x, we have the lower bounds on x; in the 
second conjunction, /\ . S"'x < u n we have the upper bounds. If 
the second conjunction is empty (i.e., if there are no upper bounds 
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on x), then we can replace the formula by 0 = 0. (Why?) If the 
first conjunction is empty (i.e., if there are no lower bounds on x ), 
then we can replace the formula by 

A S'"° < uj, 
i 

which asserts that zero satisfies the upper bounds. Otherwise, we 
rewrite the formula successively as 

3 x A (h < S m 'x A S nj x < uj). (1) 

‘.j 

3 x A (S"' U < S mi+n *x < S m ‘ uj). (2) 

i>j 

^A S"' +1 f/ < A A S"'0 < Uj. (3) 

This last formula says “any lower bound plus one satisfies any up¬ 
per bound, and furthermore zero satisfies any upper bound.” This 
implies that there is a gap between the greatest lower bound and 
the least upper bound, whence there is a solution for x . The second 
part guarantees that the solution for x is not forced to be negative. 

In each case, we have arrived at a quantifier-free version of the 
given formula. H 

Corollary 32B (a) Cn A L is complete, 

(b) Cn Ai — Th 91 l . 

*(c) Th 91/, is decidable. 

Proof, (a) The argument that followed the proof of Theorem 31G 
is applicable here also, (b) This follows from (a), since Cn A L C 
Th Ti/ and Th 91, is satisfiable. For (c), we can use the fact that 
any complete axiomatizable theory is decidable. But the quantifier 
elimination proof yields a more efficient decision procedure. H 

Corollary 32C A subset of N is definable in 91/ iff it is either 
finite or has finite complement. 

Proof. Compare Exercise 4 of the preceding section. H 

On the other hand, 91 l has more definable binary relations than has 
9tj. For the ordering relation { (m, n) \ m < n) is not definable in 91$, 
by Exercise 5 of the preceding section. 

Corollary 32D The addition relation, 

{(/77 , n, p) | m + n — p }, 
is not definable in 91 l • 



Chapter 3: Undecidability 


197 


Proof. If we could define addition, we could then define the set of 
even natural numbers. But this set is neither finite nor has finite 
complement. H 

Now suppose we augment the language by the addition symbol +. 
The intended structure is 

9U = (N; 0,5, <,+). 

The theory of this structure is also decidable, as we will prove shortly. 
But to keep matters from getting even more complicated, we will avoid 
listing any convenient set of axioms for the theory. 

The nonstandard models of TIi9Ia must also be models of Th 'Jl/. 
So they have a standard part, followed by some Z-chains. But ordering 
among the Z-chains can no longer be arbitrary. Let 21 be a nonstandard 
model of Th9t A . The ordering < SI induces a well-defined ordering on 
the set of Z-chains. (See Exercise 3.) We claim that there is no largest 
Z-chain, there is no smallest Z-chain, and there is between any two Z- 
chains another one. The reasons, in outline, can be stated simply: If a 
belongs to some Z-chain (i.e., is an infinite element of 21), then a +' 1 a 
is in a larger Z-chain. There must be some b such that b + 21 b is either 
a or its successor; b must be in a smaller Z-chain. If ci\ and a 2 belong to 
different Z-chains, then there must be some b such that b + 21 b is either 
a\ + 21 a 2 or its successor. And b will lie in a Z-chain between that of 
a 1 and that of « 2 . (These statements should seem quite plausible. The 
reader who enjoys working with infinite numbers might supply some 
details.) 

*Theorem 32E (Presburger, 1929) The theory of the structure 

9!a = (N; 0, S, <, +) is decidable. 

The proof is again based on a quantifier elimination procedure. 
The theory of 91 a itself does not admit elimination of quantifiers. 
For example, the formula defining the set of even numbers 

3 y v x = y + y 

is not equivalent to any quantifier-free formula. We can overcome 
this by adding a new symbol = 2 for congruence modulo 2. Sim¬ 
ilarly, we add symbols = 3 , =4 .The intended structure for 

this expanded language is 

9T = (N; 0, S, <, +, = 2 , = 3 ,...), 

where =*, is the binary relation of congruence modulo k. It turns 
out that the theory of this structure does admit elimination of 
quantifiers. 

This by itself does not imply that the theory of either structure 
is decidable. After all, we can start with any structure, and expand 
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it to a structure having additional relations until a structure is ob¬ 
tained that admits elimination of quantifiers. To obtain decidabil¬ 
ity, we must show that we can, given a sentence er, ( 1 ) effectively 
find a quantifier-free equivalent o', and then ( 2 ) effectively decide 
if o' is true. 

We will now give the quantifier elimination procedure for 
Th9I = . For a term t and a natural number n, let nt be the term 
t + t + ■ • ■ + t, with n summands. 0 1 is 0. Then any term can be 
expanded to one of the form 

S"°0 + n\X\ H-b n k x k 

for k > 0, rij > 0 (where the x,’s are variables). For example, 
S(x + SO) + S y 

becomes 

S 3 0 + x + y. 

As usual we begin with a formula 3 y(/3 1 A • • • A fi n ), where 
P/ is an atomic formula or the negation of one. 

1. Eliminate negation. Replace —i(fi = h) by (t\ <_to V *2 < 1\ ). 
Replace —i(fi < ti) by (fi = f 2 V ?2 < t\). And replace — \{t\ =,„ ti) 
by 

t\ = m h + S 1 0 V • • • V t\ = m ti + S'" *0. 

Then regroup into a disjunction of formulas of the form 
3 y(ai A • • • A a m ), 

where each a ,• is atomic. We may further suppose, as before, that 
y occurs in each a,-, and in fact that a, has one of the four forms 

ny + t = u, 
ny + t = m u, 
ny + t < u, 

u < ny + t, 

where u and t are terms not containing y. In what follows we 
will take the liberty of writing these formulas with a subtraction 
symbol: 


ny = u — t, 
ny = m u — t, 
ny < u — t, 
u — t < ny. 


These are merely abbreviations for the formulas without subtrac¬ 
tion obtained by transposing terms. 
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For example, we might have at this point the formula 
3 y(w < 4y A 2y < u A 3y < v A y =3 f), 

where t, u, v, and w are terms not containing y. 

2. Uniformize the coefficients of y. Let p be the least common 
multiple of the coefficients of y. Each atomic formula can be 
converted to one in which the coefficient of y is p, by “multiplying 
through” by the appropriate factor. This is obviously legitimate 
for equalities and inequalities. In the case of congruences one 
must remember to raise the modulus also: 

a =„, b iff ka =^,„ kb. 

In the example above p is 12, and we obtain 

3 y(3u> < 12y A 12y < 6u A 12y < 4v A 12y = 3 ^ 12t). 

3. Eliminate the coefficient of y. Replace py by x and add the 
new conjunct x = p 0. (In place of 3 y • • • 12y • • • we can equally 
well have, “There exists a multiple x of 12 such that ■ ■ ■ x ■ ■ ■.”) 
Our example is now converted to 

3 jc(3 w < x A x < 6u A x < 4v A x =36 12 1 A x =12 0). 

4. Special case. If one of the atomic formulas is an equality, 
x + t — u, then we can replace 

3x6 

by 

K-t A f < M- 


Here replacement of x by “u — t” is the natural thing; we transpose 
terms to compensate for the absence of subtraction. For example, 

(.r u) M _£ is u v f- 

5. We may assume henceforth that = does not occur. So we 
have a formula of the form 

3 x[ro — so < x A • • • A r/_ 1 — s/_ 1 < x 
A x < to — Mo A • • • A x < 4-1 — Kjfc-i 
A x = mo v 0 - w 0 A • • • A .v v„-i - w„-i], 

where r,, Sj , , w,, v,-, and w, are terms not containing x. This can 
be abbreviated 


3 x 


/\ 0 - Sj < x A /\x < tj - ui A x = mj Vi - Wj 
j<l 


i<k 


If there are no congruences (i.e., n = 0), then the formula as¬ 
serts that there is a nonnegative space between the lower and 
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upper bounds. We can replace the formula by the quantifier-free 
formula: 

A A ~ s j' ) + SO < h ~ u i A A 0 < ~ M/ - 

i<kj <1 i<k 

Let M be the least common multiple of the moduli mo,..., 
m„_i. Then a + M = mi a. So as a increases, the pattern of residues 
of a modulo mo, ..., m„_i has period M. Thus, in searching for a 
solution to the congruences, we need only search M consecutive 
integers. 

We now have a formula that asserts the existence of a natural 
number which is not less than certain lower bounds L\, ..., Lj 
and which satisfies certain upper bounds and certain congruences. 
If there is such a solution, then one of the following is a solution: 

Li, Li + 1, ...,Li + M - 1, 

l 2 ,l 2 + i,...,l 2 + m- 1, 

L h L,+ l,...,L, + M- 1, 

0, 1,..., M — 1. 

(The last line is needed to cover the case in which every Lj is 
negative. To avoid having to treat this line as a special case, we 
will add a new lower bound of 0. That is, let r/ = 0 and s/ = SO 
so that 

n - si < x 

is a formula 0 < x + SO asserting that x is nonnegative. We now 
have / + 1 lower bounds.) 

Our formula (asserting the existence of a solution for x) can 
now be replaced by a quantifier-free disjunction that asserts that 
one of the numbers in the above matrix is a nonnegative solution: 

V V A n ~ s ‘ < {r i ~ s j' ) + S<?0 

j<I 1 <q<M Li <1 

A f\(rj - Sj ) + SA) < ti - Ui 

i<k 

A f\(rj - Sj)+S q 0= mi vi - wi 

i<n 

In our continuing example we have, after adding the new lower 
bound on x, 

3 x(3 w < x A 0 < .v + SO A x < 6u A x < 4i> 

A x =36 12r A x =i 2 0). 

The quantifier-free equivalent is a disjunction of 72 conjunctions. 
Each conjunction has six constituents. 
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This proves half of the theorem. If we are given a sentence a, 
the above procedure tells us how to find effectively a quantifier- 
free sentence r (in the language of < Jt = ) that is true (in the intended 
structure) iff a is. Now we must decide if r is true. 

But this is easy. It is enough to look at atomic sentences. Any 
variable-free term can be put in the form S"0. Then, for example, 

S"0 =„, S'’0 


is true iff n = m p. ~\ 

Thus we have a decision procedure for Til hl 4 . In 1974 Michael 
Fischer and Michael Rabin showed, however, that there is no decision 
procedure that is fast enough to be feasible for very long formulas. 

A set D of natural numbers is said to be periodic if for some positive 
p, any number n is in I) iff n + p is in D. D is eventually periodic iff 
there exist positive numbers M and p such that for all n greater than M , 
n e D iff n + p e D. 

Theorem 32F A set of natural numbers is definable in (N; 0, 5, 
<, +) iff it is eventually periodic. 

Proof. Exercise 1 asserts that every eventually periodic set is de¬ 
finable. Conversely, suppose D is definable. Then D is defin¬ 
able in 91 = by a quantifier-free formula (whose only variable is 
V ]). Since the class of eventually periodic sets is closed under 
union, intersection, and complementation, it suffices to show that 
every atomic formula in the language of 91 = whose only vari¬ 
able is v i defines an eventually periodic set. There are only four 
possibilities: 

nv\ + t = u, 
nv\ + t < u , 

u < nv i + t, 
nv i + t = m u , 

where u and t are numerals. The first two formulas define finite 
sets (which eventually have period 1), the third defines a set with 
finite complement, and the last formula defines a periodic set with 
period m. H 

Corollary 32G The multiplication relation 
{(/«, n, p) | p = m ■ n in N) 
is not definable in (N; 0, S, <, +). 

Proof. If we had a definition of multiplication, we could then use 
that to define the set of squares. But the set of squares is not 
eventually periodic. H 
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Exercises 

1. Show that any eventually periodic set of natural numbers is definable 
in the structure 91 a . 

2. Show that in the structure (N; +) the following relations are 
definable: 

(a) Ordering, {{m, n) \ m < «}. 

(b) Zero, {0}. 

(c) Successor, {(/«, n) \ n = S(m)}. 

3. Let 21 be a model of Th 91/. (or equivalently a model of Ai). For a 
and b in 21 define the equivalence relation: 

a ~ b ■<=> S sl can be applied a finite number of times to one 
of a , b to reach the other. 

Let |a] be the equivalence class to which a belongs. Order equiva¬ 
lence classes by 

[a] -< [£>] iff a < 21 b and a 'f b . 

Show that this is a well-defined ordering on the set of equivalence 
classes. 

4. Show that the theory of the real numbers with its usual ordering, 
Th(K.; <), admits elimination of quantifiers. (Assume that the lan¬ 
guage includes equality.) 


SECTION 3.3 _ 

A Subtheory of Number Theory 

We now return to the full language of number theory, as described in 
Section 3.0. The parameters of the language are V, 0, S, <, +, •, and 
E. The intended structure for this language is 

92= (N; 0, S, E). 

Actually in (N; •, E) wecandefine {0}, S, <, and+. (See Exercise 1.) 
As we will show in Section 3.8, in (N; +, •) we can define E as well as 
0, S, and <. So there are ways in which we could economize. The luxury 
of having all these parameters (particularly E) will simplify some of the 
proofs. 

As we shall see, Th 9! is a very strong theory and is neither decidable 
nor axiomatizable. In order to prove this fact (and a number of related 
results), it will be strategically wise to select for study a finitely axiom¬ 
atizable subtheory of Th9L As hinted at in Section 3.0, this subtheory 
should be strong enough to represent (in a sense to be made precise) 
facts about decidable sets. The subtheory we have selected is Cn Ae, 



Chapter 3: Undecidability 


203 


where A E is the set consisting of the eleven sentences listed below. (As 
in the preceding section, x < y abbreviates x < v V A' = y.) 

Set A e of Axioms 


Vx 

Sx^O 

(SI) 

V x V y 

(Sx = Sy —* x = v) 

(S2) 

Vx Vy 

(x < S v •*-» x < y) 

(LI) 

Vx 

A 

o 

(L2) 

V x V y 

(x <yVx = yVy<x) 

(L3) 

Vx 

x + 0 = x 

(Al) 

V x V y 

x + Sy = S(x + y) 

(A2) 

Vx 

o 

II 

o 

(Ml) 

Vx Vy 

x • Sy = x • y + x 

(M2) 

Vx 

xEO = SO 

(El) 

Vx Vy 

x E Sy = x E y • x 

(E2) 


Since Ot is a model of A E , we have Cn A E C Th fR. But (as we will 
prove in Section 3.5) equality does not hold here. In fact, it can be shown 
that A e Y- S3, where S3 is the sentence V y(y ^0 —»• 3 x y = Sx). 

The first five axioms give us some, but not all, of the axioms regard¬ 
ing S and < that were useful in the preceding sections. The other six 
axioms are the “recursion” equations for addition, multiplication, and 
exponentiation. 

We first show that certain simple sentences in Th 'Jt are deducible 
from A E . 

Lemma 33A (a) A E hV.ri^ 0. 

(b) For any natural number k, 

A e h Vx(x < S A ' +1 0 «-> x = S°0 V • • • V x = S A 0). 

Notice that (a) can be thought of as the k — — 1 case of (b), where 
the empty disjunction is _L. The lemma tells us that A E “knows” that 
the numbers less than 7, for example, are exactly 0, 1, 2, 3, 4, 5, 6. So in 
any model of A E , the standard points — the ones denoted by numerals 
S k 0 — are ordered in the natural way, and (by L3) the infinite points, if 
any, are all larger than any standard point. 

Proof. Part (a) is L2. For (b) we use induction (in English) on k. 
We have as a consequence of L1, 

x < SO x < 0 V x = 0, 
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which together with L2 gives 

x < SO x = 0, 

which is the k = 0 case of (b). For the inductive step we again 
apply L 1 : 

x < S A+I 0 «-► x < S A 0 V x = S A 0. 

By the inductive hypothesis, x < S A 0 can be replaced by 
x = S°0 V • • • V x = S A-1 0, 

whereby we obtain (b). H 

Lemma 33B For any variable-free term f, there is a unique natural 
number n such that 


A E \- t — S"0. 

Proof. The uniqueness is immediate. (Why? Because A E , weak 
as it is, at least knows, by SI, that 7^0, and by S2 80 times, 
that 87 / 80.) For the existence half, we use induction on t. If t 
is 0, we take n — 0. If t is Sm, then by the inductive hypothesis 
A e F u = S'"0 for some m. Hence A E F t — S'” + 1 0. 

Now suppose t is u\ + « 2 . By the inductive hypothesis 
A e h t — S'"0 + S"0 for some m and n. We now apply A2 n 
times and A1 once to obtain A E h r — S'" + "0. The arguments for 
multiplication and exponentiation are similar. H 

As a special case of this lemma we have “2 + 2 = 4” (i.e., S 2 0 + 
S 2 0 = S 4 0) as a consequence of A E . A E is at least smart enough to 
evaluate variable-free terms. And the proof shows more than this. The 
proof provides exact instructions for how, given such a term t, to find 
effectively the unique number n such that A E I - t — S"0. 

Theorem 33C For any quantifier- free sentence r true in fit, A E F r. 

Proof. Exercise 2. Start with the atomic sentences; these will be of 
the form t\ — t 2 or t\ < ti for variable-free terms t\ and O• Show 
that A e proves r if r is true in 0T, and refutes r (i.e., proves —i r) 
if r is false in 91 H 

Later on, we will improve on Theorem 33C by allowing r to contain 
“bounded quantifiers”; see Theorem 331. 

A simplified notation (used earlier in Section 2.7) for substitution 
will be helpful in the coming pages: 

(pit) = 

<P(h,t2) = 

and so forth. Thus (p = (p{v\) — <p(v \, V 2 ). Usually the term substituted 
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will be a numeral, for example 

</°(S"0, S fc 0) = (<$„)£„. 

But at times we will also substitute other terms, e.g., tp (x ) = (p '"', where 
x is a variable. If, however, x is not substitutable for v\ in <p, then we 
must take <p(x) — 1 jr* 1 , where ifr is a suitable alphabetic variant of <p. 

In the next proof (and elsewhere in this chapter) we make use of 
the following consequence of the substitution lemma of Section 2.5: 
For a formula (p in which at most V\,... ,v n occur free and for natural 
numbers a\,... ,a n , 

N<n • • •, Nm <P(S fll 0,..., S fl "0). 

An existential (3 1 ) formula is one of the form 3 x\ • • • 3x^9, where 
0 is quantifier-free. The following result improves Theorem 33C: 

Corollary 33D If r is an existential sentence true in OL then 
A e F r. 

Proof. If 3 v\ 3 in 6 is true in Tt, then for some natural numbers 
m and n, 9( S'"0, S"0) is true in 01. As this is a quantifier-free true 
sentence, it is deducible from A E . But it in turn logically implies 
3 v\ 3 V 2 9. H 

On the other hand, it is known that there are true universal (Vi) 
sentences (i.e., of the form Vxi • • • Vx&0 for quantifier-free 9) that are 
not in Cn A E . 

Representable Relations 

Let R be an m -ary relation on N; i.e., R C N m . We know that a formula 
p (in which only V \,..., v m occur free) defines R in 9T iff for every 
,..., a m in N, 

(r/i,. . . , Gm) € R • • • 5 

h m p(S fll 0 , ...,s fl '” 0 ). 

(The last condition here is equivalent to the preceding one by the sub¬ 
stitution lemma.) We can recast this into two implications: 

(a\,..., a m ) iR^^ p(S fll 0,..., S fl '»0). 

We will say that p also represents R in the theory C 11 A E if in these two 
implications the notion of truth in 01 can be replaced by the stronger 
notion of deducibility from A E . 

More generally, let T be any theory in a language with 0 and S. Then 
p represents R in T iff for every a 1 , ..., a,„ in N: 

(a u ■ ■ ■, a m ) e R p(S fll 0,..., S fl »0) e T, 

(a u ..., a m ) $ R =► ( -1 p(S fl >0,..., S fl -0)) e T. 
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For example, p represents R in the theory Th 'Jt iff p defines R in 'It. 
But p represents R in Cn A E iff for all , a m : 

(a u ...,a m ) e R^ A e h p(S fll 0,..., S fl "0), 

(a u ...,a m ) £R^A e b -ip(S fll 0,..., S fl »0). 

The equality relation on N, for example, is represented in Cn A E by the 
formula v\ — i> 2 . For 

m = n =>• F S'"0 = S"0, 
m = n=> {SI, S2) h -.S"'0 = S"0. 

A relation is representable in T iff there exists some formula that rep¬ 
resents it in T. 

The concept of representability should be compared with that of 
definability. In both cases we are somehow describing relations on the 
natural numbers by formulas. In the case of definability, we ask about 
the truth of sentences in the interpretation. In the case of representability 
in Cn A e , we ask instead about the deducibility of sentences from the 
axioms. 

Say that a formula <p, in which no variables other than v\,..., v m 
occur free, is numeralwise determined by A E iff for every m-tuple 
ai,... ,a m of natural numbers, either 

A e V- <p(S fll 0,..., S fl, "0) 
or 

A e h -Mp(S fl >0.S n '"0) 

Theorem 33E A formula p represents a relation R in Cn A E iff 

(1) p is numeralwise determined by A E , and 

(2) p defines R in 9T 

Proof. We use the fact that fR is a model of A E . If p represents R 
in Cn A e . then it is clear that (1) holds; (2) holds since “ A E h” 
implies Conversely, if (1) and (2) hold, then we have 

(«i,..., a m ) e => t P(S fll 0,..., S fl -»0) by (2) 

=>■ A e Y- -i p(S fll 0, ..., S fl '"0) since D3 is a model 

of A e 

^A £ hp(S" l fl,...,S"-0) by (1). 

Similarly for the complement of R and -ip. H 

Church's Thesis 

We now turn to the relationship of the concepts of representability and 
decidability. 

*Theorem 33F Assume that R is a relation representable in a con¬ 
sistent axiomatizable theory. Then R is decidable. 
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Proof. Say that p represents R in the consistent axiomatizable 
theory T. Recall that T is effectively enumerable (Corollary 25F). 
The decision procedure is as follows: 

Given a \, a m , enumerate the members of T. If, in the 
enumeration, p(S fll 0, ..., S' lm 0) is found, then we are done and 
(a i,..., a m ) e R. If, in the enumeration, -i p(S fll 0,..., S""'0) is 
found, then we are done and (a \,..., a,„) ^ R. 

By the representability, one sentence or the other always ap¬ 
pears eventually, whereupon the procedure terminates. Since T is 
consistent, the answer given by the procedure is correct. H 

*Corollary 33G Any relation representable in a consistent finitely 
axiomatizable theory is decidable. 

What about the converse to the above corollary? We cannot really 
hope to prove the converse on the basis of our informal notion of de¬ 
cidability. For our informal approach is usable only for giving lower 
bounds on the class of decidable relations (i.e., for showing that certain 
relations are decidable) and is unsuited to giving upper bounds (i.e., for 
showing undecidability). 

It is nevertheless possible to make plausibility arguments in support 
of the converse. This will be easier to do at the end of Section 3.4 than 
here. Roughly, the idea is that in a finite number of axioms we could 
capture the (finitely long) instructions for the decision procedure. 

The assertion that both the above corollary and its converse are cor¬ 
rect is generally known as Church’s thesis. This assertion is not really 
a mathematical statement susceptible to proof or disproof; rather it is a 
judgment that the correct formalization of the informal notion of decid¬ 
ability is by means of representability in consistent and finitely axiom¬ 
atizable theories. 

Definition. A relation R on the natural numbers is recursive iff it 
is representable in some consistent finitely axiomatizable theory 
(in a language with 0 and S). 

Church’s thesis now can be put more succinctly: A relation is de¬ 
cidable iff it is recursive. Or perhaps more accurately: The concept of 
recursiveness is the correct precise counterpart to the informal concept 
of decidability. The situation is analogous to one encountered in calcu¬ 
lus. An intuitively continuous function (defined on an interval) is one 
whose graph you can draw without lifting your pencil off the paper. But 
to prove theorems, some formal counterpart of this notion is needed. 
And so one gives the usual definition of e-5-continuity. One should ask 
if the precise notion of e -S -continuity is an accurate formalization of 
intuitive continuity. If anything, the class of e-5-continuous functions is 
too broad. It includes nowhere differentiable functions, whose graphs 
cannot be drawn without lifting the pencil. But accurate or not, the class 
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of s-8 -continuous functions has been found to be a natural and important 
class in mathematical analysis. 

Very much the same situation occurs with recursiveness. One should 
ask if the precise notion of recursiveness is an accurate formalization of 
the informal notion of decidability. Again, the precisely defined class 
(of recursive relations) appears to be, if anything, too broad. It includes 
relations for which any decision procedure will, for large inputs, require 
so much computing time and memory (“scratchpad”) space as to make 
implementation absurd. Recursiveness corresponds to decidability in 
an idealized world, where length of computation and amount of mem¬ 
ory are disregarded. But in any case, the class of recursive relations 
has been found to be a natural and important class in mathematical 
logic. 

Empirical evidence that the class of recursive relations is not too 
narrow is provided by the following: 

1. Any relation considered thus far that mathematicians have felt 
was decidable has been found to be recursive. 

2. Several people have tried giving precise definitions of idealized 
computing agents. The best-known such idealized agents are the “Turing 
machines,” introduced by Alan Turing in 1936. (A variation on that idea 
leads to the register machines described in Section 3.6.) The idea was 
to devise something that could carry out any effective procedure. In 
all cases, the class of relations having decision procedures executable 
by such a computing agent has been exactly the class of recursive 
relations. (Because of the importance of Turing’s analysis of effec¬ 
tive computability. Church’s thesis is often called the Church-Turing 
thesis.) 

The fact that so many different (yet equivalent) definitions for the 
class of recursive relations have been found is some indication of the 
naturalness and importance of the concept. 

In this book we will continue to exclude the informal notion of decid¬ 
ability from nonstarred theorems. But in the remainder of the exposition 
we will accept Church’s thesis. For example, we will speak of a set’s 
being undecidable when we have a theorem stating it to be nonrecursive. 

Obviously any relation representable in Cn A E is recursive. We will 
prove later that the converse also holds; if a relation is representable in 
any consistent finitely axiomatizable theory, then it is representable in 
the one theory we have selected for special study. (This was, of course, 
a motivating factor in our selection.) 

The use of the word “recursive” in this context is the result of histori¬ 
cal accident — even of historical error. Recently several mathematicians 
have argued that the word “computable” would more accurately reflect 
the intended ideas. But in the present context, we want to reserve the 
word “computable” for an informal concept, to be defined next. For 
relations we have the informal concept of decidability; for functions the 
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analogous concept is computability. (As notational shorthand, a string 
ci \,..., a k can be written as a.) 

’‘'Definition. A function f : —> N is computable iff there is an 

effective procedure that, given any A'-tuple a of natural numbers, 
will produce f(a). 

For example, addition and multiplication are computable. Effective 
procedures, using base-10 notation, for these functions are taught in the 
elementary schools. (Strictly speaking, in the concept of computabil¬ 
ity one should refer to being given numerals, not numbers. For it is 
numerals — strings of symbols like the triple 317 or the triple XCI — 
that can be communicated. Nonetheless, we will suppress this point.) On 
the other hand, of the uncountably many functions from into N, only 
countably many can be computable, because there are only countably 
many effective procedures. 

We want to give a mathematical counterpart to the informal concept 
of computability, just as in the case of decidable relations. The clue to 
the correct counterpart is provided by the next theorem. Recall that any 
function / : N* —> N is also a (k + l)-ary relation on N: 

{ai,..., a k , b) e / - a k )=b. 

At one time it was popular to distinguish between the function and 
the relation (which was called the graph of the function). Current set- 
theoretic usage takes a function to be the same thing as its graph. But 
we still have the two ways of looking at the function. 

*Theorem 33H The following three conditions on a function 
/ : N* -» N are equivalent: 

(a) / is computable. 

(b) When viewed as a relation, / is a decidable relation. 

(c) When viewed as a relation, / is an effectively enumerable 
relation. 

Proof, (a) =£> (b): Assume that / is computable; we will de¬ 
scribe the decision procedure. Given {a\,... ,a k ,b), first com¬ 
pute f{a \,..., a k ). Then look to see if the result is equal to b. If 
it is say “yes,” otherwise say “no.” 

(b) => (c): Any decidable relation is effectively enumerable. 
For we can enumerate the set of all (k + l)-tuples of numbers, and 
place on the output list those which meet the test of belonging to 
the relation. 

(c) => (a): Assume that we have an effective enumeration of 

(the graph of) /. To compute /(cq,..., a k ) we examine the 
(k + l)-tuples in the enumeration until we find the one that begins 
with a\... ,a k . Its last component is then the desired function 
value. H 
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Thus by using Church’s thesis, we can say that / is computable iff 
/ (viewed as a relation) is recursive. The class of recursive functions is 
an interesting class even apart from its connection with incompleteness 
theorems of logic. It represents an upper bound to the class of functions 
that can actually be computed by programs for digital computers. The 
recursive functions are those which are calculable by digital comput¬ 
ers, provided one ignores practical limitations on computing time and 
memory space. 

We can now describe our plans for this section and the next. Our 
basic goal is to obtain the theorems of Section 3.5. But some ground¬ 
work is required before we can prove those theorems; we must ver¬ 
ify that a number of relations (intuitively decidable) and a number 
of functions (intuitively computable) are representable in Cn A E and 
hence are recursive. In the process we will show (Theorem 34A) that 
recursiveness is equivalent to representability in Cn A E . In the remain¬ 
der of the present section we will establish general facts about rep¬ 
resentability, and will show, for example, that certain functions for 
encoding finite sequences of numbers into single numbers are repre¬ 
sentable. Then in Section 3.4 we apply these results to particular re¬ 
lations and functions related to the syntactical features of the formal 
language. 

The author is sufficiently realistic to know that many readers will be 
more interested in the theorems of Section 3.5 than in the preliminary 
spadework. If the reader is willing to believe that intuitively decidable re¬ 
lations are all representable in Cn Ae, and intuitively computable func¬ 
tions are functionally representable (a concept we will define shortly) 
there, then most if not all of the proofs in this spadework become un¬ 
necessary. But it is hoped that the definitions and the statements of the 
results will still receive some attention. 

Numeralwise Determined Formulas 

Theorem 33E tells us that we can show a relation to be representable 
in Cn Ae by finding a formula that defines it in 91 and is numeralwise 
determined by A E . The next theorem will be useful in establishing 
numeralwise determination. 

Theorem 331 (a) Any atomic formula is numeralwise determined 

by A e . 

(b) If <p and x[r are numeralwise determined by A E , then so are 
-i ip and ip —> \fr. 

(c) If <p is numeralwise determined by A E , then so are the fol¬ 
lowing formulas (obtained from ip by “bounded quantification”): 


Vi(r < y —* ip) i 
3 x(x < y A ip). 
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Proof. Part (a) follows from Theorem 33C. Part (b) is easy. It 
remains to prove part (c). We will consider a formula 

3 x(x < y A <p(x, y, z)) 

in which just the variables y and z occur free. Consider two natural 
numbers a and b; we must show that either 

A e h 3 x(x < S a 0 A <p(x, S"0, S fc 0)) 

or 

A e h 3x(x < S"0 A <p(x, S fl 0, S*0)). 

Case 1: For some c less than a, 

A e h ^(S c 0,S"0,S fc 0). (1) 

(This case occurs iff 3 x(x < S"0 A <p(x, S fl 0, S , ’0)) is true in 01) 
We also have 


A e h S c 0 < S fl 0. (2) 

And the sentences in (1) and (2) logically imply the sentence 

3x(x < S fl 0 A <p(x, S fl 0, S b 0)). 

Case 2: Otherwise for every c less than a, 

A e h -n<p(S f 0,S fl 0,S /, 0). (3) 

(This case occurs iff Vx(x < S"0 —* —> <p(x, S"0, S & 0)) is true in 
01.) We know from Lemma 33A that 

A e h Vx(x < S fl 0 -f x = S°0 V • • • V x = S fl -'0). (4) 

The sentence in (4) together with the sentences in (3) (for 
c = 0, ..., a — 1) logically imply 

Vx(x < S fl 0 <p( Xl S fl 0, S fc 0)). 

And this is equivalent to 

-i 3 x(x < S a 0 A <p(x, S a 0, S b 0 )). 

This shows that 3 x(x < y A <p(x, y, z)) is numeralwise deter¬ 
mined by A e . By applying this result to -i <p we obtain the fact 
that the dual formula, Vx(x < y —> (p{x, y, z)), is numeralwise 
determined by A e as well. H 

The argument in case 2 relied on the fact that the x quantifier was 
bounded by S°(). We will see that it is possible for 

tA(S°0), tA(S‘0), ... 
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all to be consequences of A E without having 

Vi -i i p{x) 


be a consequence. 

The preceding theorem is a useful tool for showing many relations 
to be representable in Cn A E . For example, the set of primes is repre¬ 
sented by 

S'O < Vi A Vi(i < V\ —> Vy(y <v\ —> x • y ^ t>i)). 

This formula defines the primes in Tl, and by the preceding theorem is 
numeralwise determined by A E . It therefore represents the set of primes 
in Cn A E . 

Representable Functions 

Often it is more convenient to work with functions than with relations. 
Let f : N m —»• N be an m -place function on the natural numbers. A for¬ 
mula (p in which only v\,..., v m+ 1 occur free will be said to functionally 
represent f (in the theory Cn A E ) iff for every a\,... ,a m in N, 

A e h V v m+l [<p(S fll 0,..., S fl "'0, v m+1 ) <- v m+1 = .^O]. 

(Observe that the “«—” half of this sentence is equivalent to <^(S" 0,..., 
S"" 0, S-f (ai ’-’ am) 0). The +” half adds an assertion of uniqueness.) 

Theorem 33J If <p functionally represents / in Cn A E , then it also 
represents / (as a relation) in Cn A E . 

Proof, with m — 1 . Since <p functionally represents /, we have 
for any b: 

A e F <p(S fl 0, S A 0) S /J 0 = S /(fl) 0. 

If (a, b) e /, i.e., if f(a) = b, then the right half of this bicondi¬ 
tional is valid and we get 

A e h <p(S fl 0, S*0). 

But otherwise the right half is refutable from A E (i.e., its negation 
is deducible), whence 

A e \ -' <p(S"0, S fc 0). _l 

The converse of this theorem is false. But we can change the formula: 

Theorem 33K Let / be a function on N that is (as a relation) repre¬ 
sentable in Cn A E ■ Then we can find a formula <p that functionally 
represents / in Cn A E . 
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Proof. To simplify the notation we will take / to be a one-place 
function on N. The desired sentence, 

V u 2 [<p(S fl 0, v 2 ) «-► v 2 = S /(fl) 0], 
is equivalent to the conjunction of the two sentences 

<p(S fl 0, S^ (fl) 0) (1) 

and 

V v 2 [<p(S fl 0, v 2 ) ->v 2 = S /(fl) 0]. (2) 

The sentence (1) is a theorem of A E whenever <p represents /. 
The sentence (2) is an assertion of uniqueness; we must construct 
(p in such a way that this will also be a theorem of A E . 

Begin with a formula 6 known to represent / (as a binary 
relation). Let (p be 

0(v i, v 2 ) A V z(z < v 2 —* -*0(vi,z)). 

We can then rewrite (2) as 

V u 2 [d(S"0, v 2 ) A V z(z < v 2 -► -.0(1S fl 0, z)) 

—> v 2 —S f(a> 0]. (2 ') 

To show this to be a theorem of A y it clearly suffices to show that 
A e U {<9(S fl 0, v 2 ), Vz( z <v 2 -+^ 0{ S"0, z))} = S /(fl) 0. 

Call this set of hypotheses (to the left of “b”) T. Since L3 e A 2 
it suffices to show that 

Th» 2 < S /(a) 0 (3) 

and 

r b s /(a) o < v 2 . (4) 

It is easy to obtain (4), since from the last member of T we get 
S^ <fl, 0 < v 2 —+ -i 0(S fl O, S /(a) 0) 
and we know that 

A e b d(S"0, S /(fl) 0). (5) 

To obtain (3) we first note that we have as theorems of A E , 

v 2 < ^ v 2 = S°0 V • • • V v 2 = S-^-'O (6) 

and 

- 1 0( S fl 0, S A 0) for b — 0 . f{a) - 1. (7) 

The formulas (6) and (7) imply the formula 

v 2 <S /(fl) 0-*-.<9(S fl 0, v 2 ). (8) 

Since d(S°0, v 2 ) e T, we have (3). 



214 


A Mathematical Introduction to Logic 


This shows (2) to be a theorem of A E \ (5) and (8) show (1) to 
be a theorem of Ae as well. H 

We next want to show that certain basic functions are representable 
(in Cn A e ) and that the class of representable functions has certain clo¬ 
sure properties. Henceforth in this section, when we say that a function 
or relation is representable, we will mean that it is representable in the 
theory Cn Ae- But the phrase “in Cn Ae'’ will usually be omitted. 

In simple cases, an m -place function might be represented by an 
equation 

Vm+l — I ■ 

In fact, any such equation, when the variables in t are among v\,..., v m , 
defines in (R an //(-place function /. (The value of / at (ci \,..., a m ) 
is the number assigned in '31 to t when Vj is assigned a,, I < i < in.) 
Furthermore, we know that any equation is numeralwise determined by 
Ae , so the equation represents / as a relation. In fact, it even functionally 
represents /, for the sentence 

V v m+ i[v m+ i = r(S fll 0,..., S' 3 '"0) <- v m+l = S /(ai -- a - ) 0] 
is logically equivalent to 

f(S fll 0, ...,S fl '"0) = S /(ai .°" ) 0, 

which is a quantifier-free sentence true in (R. (Here t (u i,..., u m ) is the 
term obtained by replacing V\ by u \ , then vn by un, etc.) For example: 

1. The successor function is represented (functionally) by the 
equation 

V2 — Swi. 

2. Any constant function is representable. The m -place function that 
constantly assumes the value b is represented by the equation 

v m+l - S*0. 

3. The projection function (where 1 </</«) 

Ij {(l 1 . tiff /1 = d! 

is represented by the equation 

— Vi . 

4. Addition, multiplication, and exponentiation are represented by 
the equations 

v 3 — Vi + v 2 , 
t> 3 = Vi • v 2 , 

V 2 — «iE»2, 

respectively. 
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The reader should not be misled by these simple examples; not every 
representable function is representable by an equation. 

We next want to show that the family of representable functions is 
closed under composition. To simplify the notation, we will consider a 
one-place function / on N, where 

f{a) = g(hi(a), h 2 (a)). 

Suppose that g is functionally represented by i {r and /; ,■ by 0,. To represent 
/ it would be reasonable to try either 

Vyi Vy 2 (0i(vi, yi) —► 0 2 (v i, y 2 ) —► ir(y i, yi, w)) 
or 

3 yi3y 2 (0i(vi, y x ) A 0 2 (v u y 2 ) A f(y x , yi, v 2 )). 

(Think of i/s (vi, y 2 , v 2 ) as saying “g(yi, y 2 ) = Vi” and think of 
0i(V \, yi) as saying “hi{v i) = Then the first formula translates, 
“For any y u y 2 , if h\(v\) = >’i and h 2 (v x ) = y 2 , then g(yi, y 2 ) — v 2 .” 
The second formula translates, “There exist yi, y 2 such that h \ (v\ ) = y x 
and h 2 (v x ) — y 2 and g(yi, y 2 ) — v 2 .” Either one is a reasonable way 
of saying, “g(h\ (v \), h 2 (v 0) = v 2 .” There are two choices, because 


when something is unique, either quantifier can be used for it.) 
Actually either formula would work; let (p be 

Vyi Vy 2 (0i(v u yi) —*- 0 2 (v x , y 2 ) —» f(y x , y 2 , v 2 )). 

Consider any natural number a; we have at our disposal 

V «4[iKS* l(o) 0, S* 2(a) 0, v 2 ) «-► v 2 = S /(fl) 0], (1) 

V yi(S^'O, yi) yi = S /!l(fl) 0]. (2) 

Vy 2 [d2(S fl 0,y 2 )^y 2 = S /!2(fl, 0]. (3) 

And we want 

V u 2 (<p(S fl 0, v 2 ) <^v 2 = S /(a) 0), (4) 

i.e., 

V v 2 ( Vyj V y 2 [d 1 (S fl 0, y x ) —* 0 2 (S fl O, y 2 ) —> f{y x , yi, vr>] 

<-*v 2 = S /(a) 0). (4) 


But (1), (2), and (3) imply (4), as the reader is asked to verify in 
Exercise 4. 

More generally we have 

Theorem 33L Let g be an 77 -place function, let h \,..., h„ be 777- 
place functions, and let / be defined by 

/(«i, • • ■, a m ) = g (h \ (ci 1 ,... ,a m ),h„{a x -- a m )). 
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From formulas functionally representing g and h \, ..., h n we can 
find a formula that functionally represents /. 

In the above proof we have m — 1 and n = 2. But the general case 
is proved in exactly the same way. 

In order to obtain a function such as 

f{a, b) — g(h(a), b ), 


we note that 

f(a,b) = g(h(tf(a, bj),ll(a, b)). 

The above theorem then can be applied (twice) to show that / is repre¬ 
sentable (provided that g and h are). 

To facilitate discussion of functions with an arbitrary number of 
variables, we will use vector notation. For example, the equation in the 
above theorem can be written 

f(a) = g(hi(a),h„ (3)). 

Another important closure property of the functions representable in 
Cn Ae is closure under the "least-zero” operator. 

Theorem 33M Assume that the (m + l)-place function g is repre¬ 
sentable and that for every a i, ..., a m there is a b such that 

g(a l, ...,a m ,b) = 0. 

Then we can find a formula that represents the w-place function 
/, where 

..., a,„) — the least b such that g(a \,..., a m , b) = 0. 


(In vector notation we can rewrite this last equation: 

/ (a) — the least b such that g(a,b) = 0. 

The traditional notation for the least-zero operator is 

f&) = fibig{ a, b) = o] 

and the operator is often called "the /x-operator.”) 

Proof. To simplify the notation we take in — 1; thus 

f(a) — b iff g(a , b) = 0 and for all c < b, g(a, c ) ^ 0. 

If t/t represents g. then we can obtain a formula representing / (as 
a relation) simply by formalizing the right side of this equivalence: 

f{vi, v 2 , 0) A V y(y < v 2 ir(v u y, 0)). 



Chapter 3: Undecidability 


217 


This formula defines (the graph of) f and is numeralwise deter¬ 
mined by Ae- H 

A Catalog 

We now construct a repertoire of representable (in Cn A E ) functions and 
relations, including in particular functions for encoding and decoding 
sequences. 

0. As a consequence of Theorem 331, any relation that has (in Tl) a 
quantifier-free definition is representable . And the class of representable 
relations is closed under unions, intersections, and complements. And 
if R is representable, then so are 

{(fli,..., a m , b) | for all c < b, (a i ,... ,a m ,c) e R } 

and 

{(fli,..., a m , b) | for some c < b, (a\, ..., a m , c) e R}. 

For example, any finite relation has a quantifier-free definition, as does 
the ordering relation. 

1. A relation R is representable iff its characteristic function Kr is. 
(Kr is the function for which Kr(o) — 1 when a e R, and Kr(o) = 0 
otherwise.) 

Proof. (4=) Say that R is a unary relation (a subset of N) and 
that Kr is represented by ijr(v t, Vi). We claim that SO) 

represents R. For it defines R and is numeralwise determined 
by A e . 

(=►) Say that cp(v i) represents R. Then 

(<p(v i) A v 2 — SO) V (-i <p(v i) Ath = 0) 

represents (the graph of) K R , for the same reason as in the last 
paragraph. (Actually this formula even functionally represents 
Kr, as the reader can verify.) H 

2. If R is a representable binary relation and /, g are representable 
functions, then 

{a | (f(a), g(a)) e Rj 

is representable. Similarly for an m -ary relation R and functions 

/l- .... /rn- 

PROOF. Its characteristic function at ci has the value KR(f(a), 
g(a)). Thus it is obtained from representable functions by com¬ 
position. H 

For example, suppose that R is a representable ternary relation. Then 

{(*, y) I ( y,x,x) e R) 
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is representable, being 

{(x,y)\(ll(x,y)jlix,y), I[(,x,y)) e R}. 

In this way we can rearrange and repeat variables in describing a repre¬ 
sentable relation. 

3. If R is a representable binary relation, then so is 

P = {{a, b) | for some c < b, (a, c) e R}. 

Proof. We have from catalog item 0 that if 

Q — {(a, b) | for some c < b, (a, c) e R], 
then Q is representable. And 

(a, b) e P (a, S(b)) e Q 

<*• ( I?(a,b),S(lZ(a,b ))> e Q. 

Hence by catalog item 2, P is representable. H 

More generally, if R is a representable ( m + l)-ary relation, then 
{(<3i,..., a m , b) | for some c < b, {a \,..., a m , c) e R} 
is also representable. In vector notation this relation becomes 
{(a, b) | for some c < b, (a, c) e R}. 

Similarly 

{(a, b) | for all c < b, (2, c) e 7?} 

is representable. 

4. The divisibility relation 

{{a, b) | a divides b in N} 

is representable. 

Proof. We have a divides b iff for some q < b,a-q — b. We know 
that{(a, b, q) \ a-q — b] is representable, as it has a quantifier-free 
definition. Upon applying the above items, we get the divisibility 
relation. (In yet further detail, from catalog item 3 we get the 
representability of 

R = {(a, b, c) | for some q < c, a ■ q — b] 
and a divides b iff (a, b, b) e R.) H 

5. The set of primes is representable, 

6. The set of pairs of adjacent primes is representable. 

Proof, (a, b) is a pair of adjacent primes iff a is prime and b is 
prime and a < b and there does not exist any c < b such that 
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a < c and c is prime. The right side of this equivalence is easily 
formalized by a numeralwise determined formula. H 

Note (for future use in Section 3.8) that we have not yet used the fact 
that exponentiation is representable. 

Observe that as this catalog progresses, we are in effect building up 
a “language” C such that anything (any relation, any function) that is 
^-definable (in 9T) will be certain to be representable in our theory. 
Thus, Theorem 331 tells us that (a) atomic formulas are allowed in C, 
(b) all sentential connectives are permitted, and (c) bounded quantifiers 
can be used. (Unbounded quantifiers are not in general allowed.) Then 
our catalog gradually adds particular predicate symbols and function 
symbols; catalog item 6 adds a two-place predicate symbol for “adja¬ 
cent primality”; and catalog item 7 will add a function symbol for the 
prime-listing function. Theorem 33L justifies using these function sym¬ 
bols inside expressions of C. 

7. The function whose value at a is p a , the (a + l)st prime, is 
representable. (Thus po = 2, pi = 3, pi — 5, p$ = 7, p 4 =11, and so 
forth.) 

2 

Proof. p a — b iff b is prime and there exists some c < b a , such 
that (i)-(iii) hold: 

(i) 2 does not divide c. 

(ii) For any q < b and any r < /?, if (q, r) is a pair of adjacent 
primes, then for all j < c, 

qi divides c r J+1 divides c. 

(iii) b a divides c and b a+l does not. 

This equivalence is not obvious, but at least the relation defined 
by the right-hand side is representable. To verify the equivalence, 
first note that if p a — b, then we can take 

oO o 1 c2 ci 

c = 2 -3 -5 p a . 

It is easy to check that this value of c meets all the conditions. 
Conversely, suppose c is a number meeting conditions (i)-(iii). 
We claim that c must be 

2° • 3 1 • ... • b a ■ powers of larger primes. 

Certainly the exponent of 2 in c is 0, by (i). We can use (ii) to 
work our way across to the prime b. But by (iii) the exponent of 
b is a, so b must be the (a + l)st prime, p a . H 



220 


A Mathematical Introduction to Logic 


This function will be very useful in encoding finite sequences of 
numbers into single numbers. Let 

(a 0 , a m ) = p a 0 o+l . p%‘ +1 

= n p? +1 . 

i<m 

This holds also for m — —1; we define ( ) = 1. For example, 

(2, 1) = 2 3 • 3 2 = 72. 

The idea is that 72 safely encodes the pair (2, 1). 

There are other ways to encode pairs of numbers and finite sequences 
of numbers. In Section 3.8, we will make use of a pairing function 

1 9 

J(a, b) = -[(a + by + 3 a + b] 

that has the advantage of growing at a polynomial rate, unlike the growth 
rate of 2" +l 3 / ’ : 1 . Here is a very different way to encode, for example, the 
numbers 24, 117, 11 (in that order). First we convert to numerals in base 
9: 26, 140, 12. Secondly, we concatenate these numerals, separated by 
9’s: 269140912. The triple is encoded by the number thereby designated 
(in base 10), that is, 269,140,912. This method may seem tricky, but it 
produces a result that is much smaller than 2 25 3 118 5 12 , which requires 
73 digits in base 10. 

8. For each m, the function whose value at ao, ..., a, n is (oq, ..., a m ) 
is representable. 

9. There is a representable function (whose value at (a, b) is written 
( a)b ) such that for b < m, 

((tt0> • • • > = Ub- 

(This is our “decoding” function. For example, (72)o = 2 and (72) i = 1.) 

Proof. We define (a)b to be the least n such that either a = 0 or 
pl +1 does not divide a. (There always is such an n.) Observe that 
(0)/, = 0, and for a ^ 0, (a)b is one less than the exponent of 
pi, in the prime factorization of a (but not less than 0). Hence for 
b < m, 

( , . . . , tLi})/? — ■ 

To prove representability we use the least-zero operator. Let 

R = {(«, b, n) | a = 0 or p'^ 2 does not divide a}. 

Then ( a)b — pn[Kj(a, b , n) — 0], where R is the complement 
of R. H 

Since the method used in the above proof will be useful elsewhere 
as well, we here state it separately: 
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Theorem 33N Assume that R is a representable relation such that 
for every a there is some n such that (a, n) e R. Then the function 
/ defined by 

f(a) = the least n such that (a, n) e R 
is representable. 

Proof. f(a) = pn[Kg(a, n) = 0]. H 

We will later use the notation 

f(a) — pn[(a , n) e R]. 

10. Say that b is a sequence number iff for some m > — 1 and some 
a 0 ,.. •, a m , 

b - (ao ,..., a„). 

(When m — — 1 we get ( ) = 1.) Then the set of sequence numbers is 
representable. 

Proof. Exercise 5. H 

11. There is a representable function lh such that 

lh(ao, • • •, a m ) = m + 1. 

(Here “lh” stands for “length.” For example, lh 72 = 2.) 

Proof. We define lh a to be the least n such that either a — 0 or 
p n does not divide a. This works. ~\ 

12. There is a representable function (whose value at (a, b) is called 
the restriction of a to b, written a \ b) such that for any b < m + 1, 

(a 0 ,...,a m ) T b = (ao, a fo _i). 

Proof. Let a f b be the least n such that either a = 0 or both n ^ 0 
and for any j < b, any k < a 

p k j divides a p k divides n . 

This works. H 

13. (Primitive recursion) With a {k+ l)-place function / we asso¬ 

ciate another function / such that f (a, b\, ..., bk) encodes the values 
of f(j,b\ -- bk) for all j < a. Specifically, let 

f(a,b)= </(0, /(a-!,*)>. 

For example, f(0,b) — ( ) = 1, encoding the first zero values of 
/. /(1, b ) = (/(0, b)). In any case, f{a, b) is a sequence number of 
length a, encoding the first a values of f . 
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Now suppose we are given a (k + 2)-place function g. There exists 
a unique function / satisfying 

fia, b) = g{f {a, b), a , b). 

For example, 

mb) = g ((),o, h, 

fihb) = gi{f(.0,b)),l,b). 

(The existence and uniqueness of this / should be intuitively clear. For 
a proof, we can apply the recursion theorem of Section 1.4, obtaining 
first / and then extracting /.) 

Theorem 33P Let g be a (k + 2)-place function and let / be the 
unique (k + l)-place function such that for all a and (^-tuples) b, 

f(a , b) = gif (a, b), a , b ). 

If g is representable, then so is /. 

Proof. First we claim that / is representable. 

This follows from the fact that 
f (a , b) — the least s such that .v is a sequence number of 

length a and for i less than a, is)i = gis f i, i, b). 

It then follows that / is representable, since 
fia, b) = gifia, b ), a, b) 

and the functions on the right are representable. H 

Actually the phrase "primitive recursion” is more commonly applied 
to a simpler version of this, given in Exercise 8. 

14. For a representable function F, the function whose value at 
a, b is 

n fh, i) 

i<a 

is also representable. Similarly with E in place of n. (For a — 0, we 
use the standard conventions: The empty product — the product of no 
numbers — is 1, and the empty sum is 0.) 

Proof. Call this function G; then 

G(0, b) = 1, 

G(a + 1 ,b) = Fia, b) ■ Gia, b). 

Apply Exercise 8. H 

15. Define the concatenation of a and b, a * b, by 

7 TT d>)i + l 

a*b=a■ [[Pi+ma- 
i < \hb 
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This is a representable function of a and b, and 

(fli, ..., dm) * (b 1 ,..., b„) = (a 1 ,.. .,a m ,bu b„). 

The concatenation operation has the further property of being associa¬ 
tive on sequence numbers. 

16. We will also want a “capital asterisk” operation. Let 
*,<«/(O = /(0)*/(l)*---*/(a —1). 

For a representable function F, the function whose value at a, h is 
*i <a F(i, b ) is representable. 

Proof. *,<oF(/, b) = {) = 1 and 

*i<a+\F(i, b) = * i<a F(i, b) * F(a, b). 

So this is just like catalog item 14. H 


Exercises 

1. Show that in the structure (N; ■, E) we can define the addition re¬ 
lation {(/«, n, m + n) \ m , n in N). Conclude that in this structure 
{0}, the ordering relation <, and the successor relation {(n, S(n)) \ 
n e N | are definable. ( Remark : This result can be strengthened by 
replacing the structure (N; •, E) by simply (N; E). The multipli¬ 
cation relation is definable here, by exploiting one of the laws of 
exponents: ( d a ) b = d ah .) 

2. Prove Theorem 33C, stating that true (in 93) quantifier-free sen¬ 
tences are theorems of A E . (See the outline given there.) 

3. A theory T (in a language with 0 and S) is called a ^-complete iff 
for any formula cp and variable x , if belongs to T for every 
natural number n, then \/xcp belongs to T. Show that if 7’ is a 
consistent cu-complete theory in the language of 91 and if Ae C T, 
then T = Th9t. 

4. Show that in the proof preceding Theorem 33L, formula (4) is 
logically implied by the set consisting of formulas (1), (2), and (3). 

5. Show that the set of sequence numbers is representable (catalog 
item 10). 

6. Is 3 a sequence number? What is lh 3? Find (T * 3) * 6 and 1 * (3 * 6). 

7. Establish the following facts: 

(a) a + 1 < p a . 

(b) {b)^ < b\ equality holds iff b = 0. 

(c) lh a < a\ equality holds iff <7 = 0. 

(d) a | i < a. 

(e) lh(a \ i) is the smaller of i and lh a. 
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8 . Let g and h be representable functions, and assume that 


m b ) = g (b), 

fia + !,£>) = h(f{a , b), a, b). 


Show that / is representable. 

9. Show that there is a representable function / such that for every 

Yl , Uq ^ ... ^ Clfi , 


f({a 0 ,...,a„)) =a n . 


(For example, /(72) = 1 and /(750) = 2.) 

10. Assume that R is a representable relation and that g and h are 
representable functions. Show that / is representable, where 


f&) = 


g(a) if a e R, 
h(ci) if a ^ R. 


11. (Monotone recursion) Assume that R is a representable binary re¬ 
lation on N. Let C be the smallest subset of N (i.e., the intersection 
of all subsets) such that for all n, ao ,..., a„_i, b. 


({ao, ..., a n - 1 ), b) e R & cij e C (for all i < n) =>■ b e C. 
Further assume that (1) for all n, ao, ..., a„_i, 


{{ao, ■ ■ ■, a„- 1 ), b) e R =>■ a,- < b (for all i < n), 

and ( 2 ) there is a representable function / such that for all n, 
ao, ..., a H _i, b. 


{{a Q -- o„_i), b) e R =$ n < f(b) 

Show that C is representable. (C is, in a sense, generated by R. 
C 7 ^ 0 in general because if (( ), b) e R, then b e C.) 


SECTION 3.4 _ 

Arithmetization of Syntax 

In this section we intend to develop two themes: 

1. Certain assertions about wffs can be converted into assertions 
about natural numbers (by assigning numbers to expressions). 

2. These (English) assertions about natural numbers can in many 
cases be translated into the formal language. And the theory Cn Ae is 
strong enough to prove many of the translations so obtained. 

This will give us the ability to construct formulas that, by expressing 
facts about numbers, indirectly express facts about formulas (even about 
themselves!). Such an ability will be exploited in Section 3.5 to obtain 
results of undefinability and undecidability. 
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Godel Numbers 

We first want to assign numbers to expressions of the formal language. 
Recall that the symbols of our language are those listed in Table IX. 


TABLE IX 


Parameters 

Logical symbols 

0. V 

1 - ( 

2 . 0 

3. ) 

4. S 

5. -i 

6 . < 

7. -*• 

8 . + 

9. = 

10 . • 

11 . 

12. E 

13. v 2 , etc. 


There is a function h assigning to each symbol the integer listed to 
its left. Thus h (V) = 0, /;(()) = 2, and h ( v ,) = 9 + 2 i. In order to make 
our subsequent work more widely applicable, we will assume only that 
we have some language with 0 and S which is recursively numbered. By 
this we mean that we have a one-to-one function h from the parameters 
of that language into the even numbers such that the two relations 

{(k, m) | k is the value of h at some m-place predicate parameter} 
and 


{(k, m) | k is the value of h at some m-place function symbol} 

are both representable in Cn Ae- Of course in the case of the lan¬ 
guage of 91 these sets are even finite. The first set is {( 6 , 2}} and the 
second is 


{(2,0), (4,1), ( 8 , 2), (10, 2 >, ( 12 , 2 >}. 

We define h on the logical symbols as before; thus h (,v) is an odd number 
for each logical symbol .v. 

For an expression e — so ■ ■ ■ s n of the language we define its Godel 
number, J 1 (e), by 

ttGo • • • s„) = ( h(s 0 ), ..., h(s n )). 

For example, using our original function h for the language of 91, we 
obtain 

tt(3 v 3 v 3 = 0) 

= tf((-Vt> 3 (-i=tbO))) 

= (1,5,0, 15, 1,5,9, 15,2,3,3) 

= 2 2 • 3 6 • 5 1 • 7 16 • ll 2 • 13 6 • 17 10 • 19 16 • 23 3 • 29 4 ■ 31 4 . 
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This is a large number, being of the order of 1.3 x 10 7 \ To a set <t> 
of expressions we assign the set 

tJd> = {#(«) I s e O} 

of Godel numbers. 

To a sequence (ao. ■.., cn,,} of expressions (such as a deduction), we 
assign the number 

<?«a 0 , ■ • •, a„)) = (tta 0 , • • •, 

We now proceed to show that various relations and functions having 
to do with Godel numbers are representable in Cn Ae (and hence are 
recursive). As in the preceding section, whenever we say that a relation 
or function is representable (without specifying a theory) we mean that 
it is representable in the theory Cn Ae- 

We will make use of certain abbreviations in the language we use (i.e., 
English, although it is coming to differ more and more from what one 
ordinarily thinks of as English). For “there is a number a such that” we 
write “3a.” In the same spirit, “3a, b < c” means “there are numbers 
a and b both of which are less than c such that.” Similarly, we may 
employ “V.” We would not have dared to employ such abbreviations in 
Chapter 2, for fear of creating confusion between the formal language 
and the meta-language (English). But by now we trust the reader to 
avoid such erroneous ways. 

1. The set of Godel numbers of variables is representable. 

Proof. It is {a | (3£> < a)a = (11 + 2b)}. It follows from results 
of the preceding section that this is a representable set. H 

2. The set of Godel numbers of terms is representable. 

Proof. The set of terms was defined inductively. And terms were 
built up from constituents with smaller Godel numbers. We will 
treat this case in some detail, since it is typical of the argument 
used for inductively defined relations. 

Let / be the characteristic function of the set of Godel numbers 
of terms. From the definition of “term” we obtain 

1 if a is the Godel number of a variable, 

1 if (3 i < □, 3k < a) [; is a sequence number 

& (Vj < lh i)f(ii)j) = 1 & A: is the value of 
h at some (lh i )-place function symbol & 

« = (k) * *y<lhi(i)y]> 

0 otherwise. 

But what upper bound for i can we use in place of that 
symbol? Before we can argue that / is representable, we will 
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need an upper bound on / that depends in some representable 
way on a. 

The claim is that we can take i < a" lh ". To see this, suppose 
that a = Jtsfi • • • t n (where s is an /r-place function symbol and 
t\,... ,t n are terms). Then we want to take i — (ff|,..., JJ t n ). 
How big could this be, in terms of a? We have the bounds: 


i = 2» fl+1 ■ • • pl'lV 


< 2 a • • • PU 

< 2" • • • P\ ba _\ because n = Ih i < lha 

< a a ■ ■ ■ a a (lh a times) because a — 2 (fl)o+1 
= (a a ) xba = ci aVaa 


Olho-l + l 
Plha-1 — Plha- 


So in the above equation for /, we replace □ by ci alba . 

Although the right side of this equation refers to /, it refers 
only to f where (i)j < a. This feature permits us to apply 
primitive recursion. f(a) = g( f(a), a), where 


1 if a is the Godel number of a variable, 

1 if (3/ < ci aVaa , Bk < a) [i is a sequence number 
& (Vj < lhi)(s)(,-). = 1 & A: is the value of 
h at some (lh / (-place function symbol & 
a = (k) * *;<ihi(Oy'L 
0 otherwise. 


For if in this equation we set 5 equal to f (a), then (s)(,-). = 

for (i ) j < a. Hence by Theorem 33P, / is representable provided 

that g is. 

It remains to show that g is representable. But this is straight¬ 
forward, by using results of the preceding section. Briefly, the 
graph of g is the union of three relations, corresponding to the 
three clauses in the above equation. Each of the three is obtained 
from equality and other representable relations by bounded quan¬ 
tification and the substitution of representable functions. H 

3. The set of Godel numbers of atomic formulas is representable. 

Proof, a is the Godel number of an atomic formula iff (3/ < 
a alha , 3 k < a) [/ is a sequence number & (V j < lh/)(/) 7 - is the 
Godel number of a term & A is the value of h at some (lh / )-place 
predicate symbol & a — (k) * *y<ih;(/)_/]- H 


4. The set of Godel numbers of wffs is representable. 
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Proof. The wffs were inductively defined. Let / be the character¬ 
istic function of the set, then 

if a is the Godel number of an atomic formula, 
if (3i <a)[a= (h((), h(->)) * i * (h())) 

& fd) = 1], 

if (3 i, j < a)[a = (/?(()) * i * (/*(—►)) * j * (/?())) 

& /(«) = /O') = 1], 

if (3i, j < a)[a = (h(V)) * i * j & i is the Godel 
number of a variable and f(J) = 1], 
otherwise. 

By the same argument used for the set of Godel numbers of terms, 
we have the representability of /. H 

5. There is a representable function Sb such that for a term or formula 
a , variable x, and term t, 

Sb(fja, tlx, | it) = tJa*. 

Proof. We will need to define Sb(a, /;. c) making use of values 
Sb(f, /;, c) where i < a. As in the case of catalog item 2 (the char¬ 
acteristic function of the set of terms), it will then be possible to 
show that both Sb and Sb are representable. 

The function Sb is described by the following six clauses 
(i)-(vi): 

(i) If a is the Godel number of a variable and a — b then 

Sb(< 7 , b, c) = c. 

(ii) If (3/ < ci aVaa , Bk < a)[i is a sequence number & (V/ < 
lhi)(*)y is the Godel number of a term & k is the value of h at some 
(lh ;)-place function or predicate symbol & a = (k) * * 7 <ih/ 0 )y] 
then 


f(a) = < 


n 

i 

i 

i 

0 


Sb(a, b, c) = (k) * * 7 -<ihiSb((i);, b, c ) 
for that i and k. 

(hi) If Of <a)[i is the Godel number of a wff & a = 
(h ((), h(— i)) * i * (h()))] then 

Sb(a, b, c) = (Ji ((), h(- 0) * Sb(f, b, c) * 

for that i . 

(iv) If (3 i, j < a)[i and j are Godel numbers of wffs & a — 
(h( 0 ) * i * (h( — y)) * j * (hQ))] then 

Sb(a, b, c) = (/?(()) * Sb(i, b, c) * {h(-y)) * Sb (j, b, c ) * </r(») 
for that i and j. 
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(v) If (3/, j < a)[i is the Godel number of a variable & i b 
& j is the Godel number of a wff & a = (/?(V)) * / * j] then 

Sb(a, b, c ) = (/r(V)) * i * Sb(y, b , c) 

for that i and j. 

(vi) If none of the above conditions on a and b are met (where 
we ignore the displayed equation for Sb(<7, b. c)) then 

Sb(<7, b, c) = a. 

Then the function Sb is obtained by primitive recursion 

Sb(a, b, c) — G(Sb(a, b, c), a, b, c) 

where G is a 4-place function. The graph of G is the union of six 
5-ary relations 

G — R\ U R 2 U R 3 U R 4 U R 5 U Rfr 

corresponding to the six clauses above. 

The first of the six is 

R\ = {(s, a, b, c, d) a is the Godel number of a variable & 
a = b & d = c}. 

The second one is 

R 2 = {(5, a, b, c, d) | (3/ < a alha , 3 k < a)[i is a sequence number 
& (V j < lh i)(i)j is the Godel number of a term & k is the 
value of h at some (lh/)-place function or predicate symbol & 
a = (k) * *;<ih,■(/),' & d = (k ) * *j <m (i)(Oj]} 

and the others are similar translations of the corresponding clauses 
in the description of Sb. 

It is necessary to note that G is indeed a function; it is single¬ 
valued. This is because no two clauses could apply to one number 
a. And if, for example, clause (ii) applies to a, then we know from 
Section 2.3 that the numbers / and k are uniquely determined. 

Finally, we apply the usual methods to verify that R\-Re are 
representable, so G is representable, so Sb is representable, so Sb 
is representable. (Substitution is a complicated operation!) H 

6. The function whose value at n is J}(S"0) is representable. 

Proof. Call this function /; then 

/(0) = <A(0)>, 

fin + 1) = <A(S)> * f{n). 

Apply Exercise 8 of the preceding section. H 

7. There is a representable relation Fr such that for a term or formula 
a and a variable x, 

(|j«, X-x) £ Fr <£> x occurs free in a. 



230 


A Mathematical Introduction to Logic 


Proof. (a,b) e Fr Sb(a, b, ()0) ^ a. H 

8. The set of Godel numbers of sentences is representable. 

Proof, a is the Godel number of a sentence iff a is the Godel 
number of a formula and for any b < a, if b is the Godel number 
of a variable then (a. b) £ Fr. H 

9. There is a representable relation Sbl such that for a formula a , 
variable x, and term t, (fta, fjx, Jlf} e Sbl iff f is substitutable for x in a. 

Proof. Exercise 1. H 

10. The relation Gen, where (a, b) e Gen iff a is the Godel number of 
a formula and b is the Godel number of a generalization of that formula, 
is representable. 

Proof, (a, b) e Gen iff a = b or (3i, j < b)[i is the Godel number 
of a variable and (a, j) e Gen and b — ((/r(V)) * i * j]. Apply 
the usual argument to the characteristic function of Gen. H 

11. The set of Godel numbers of tautologies is representable. 

The set of tautologies is informally decidable since we can use the 
method of truth tables. To obtain representability, we recast truth tables 
in terms of Godel numbers. There are several preliminary steps: 

11.1 The relation R , such that (a, b) e R iff a is the Godel number 
of a formula a and b is the Godel number of a prime constituent of a, 
is representable. 

Proof, (a, b) e R <S> a is the Godel number of a formula and one 
of the following: 

(i) a = b& (a )o ^ h((). 

(ii) (3/ <a)[a = {h{0, H~>)) * i * ((H))) and (i, b) e R). 

(iii) The analogue to (ii) but with —k 

Apply the usual argument to the characteristic function of R. H 

11.2 There is a representable function P such that for a formula 

a. P(tJa) = ..., JJ b n ), the list of Godel numbers of prime con¬ 

stituents of a. in numerical order. 

Proof. First define a function g for locating the next prime con¬ 
stituent in t|fl after t|y (where \\a is the formula a for which 
a — JJa). 

g(a, y) = the least n such that either n = a + 1 or both 
y < n and (a, n) e R. 

Next define a function h such that h (a,n) gives the (n+ 1)st prime 
constituent of t ]a (if there are that many): 

h(a , 0) = g(a, 0) h(a, n + 1) = g(a , h(a, n)). 
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Finally, let P(a) — *i<k{h(a, /)> where k is the least number for 
which h{a, k) > a. H 

11.3 Say that the integer v encodes a truth assignment for a iff v is 
a sequence number and lh v — lh P(t)or) and (Vi < lh i>)(3<? < 2)(i>); = 
((P(tJa))i, e). This is a representable condition on v and JJce. 

For example, if P(jja) = ..., JJyS„), then 

v = (<BA)> e o)> • ■ <ttAi. ««)}, 

where each e, is 0 or 1. We will later need an upper bound for v in terms 
of f)a. The largest v is obtained when each e ,■ is 1. Also f/3, < [la, so 
that 

v < <<tt«, 1), • • •, (tta, 1)) 

= */<lhP(t)a)((tt« ! !))■ 

11.4 There is a representable relation Tr such that for a formula a 
and a v which encodes a truth assignment for a (or more), (tta, v) e Tr 
iff that truth assignment satisfies a. 

Proof. Exercise 2. H 

Finally, a is a tautology iff a is a formula and for every v encoding a 
truth assignment for a, (tta, v) e Tr. The (English) quantifier on v can 
be bounded by a representable function of tta, as explained in 11.3. 

12. The set of Godel numbers of formulas of the form Vx<p —► cpf, 
where t is a term substitutable for the variable x in <p, is representable. 

Proof, a is of this form iff (3 wff tp < a) (3 variable x < a) (3 term 
t < a)[t is substitutable for x in <p and a = Vi cp —* Here 
“cp < a” means that ft (p < tta. This is easily rewritten in terms of 
Godel numbers: a belongs to the set iff (3/ < a)( 3.r <a)(Bt <a) 
[f is the Godel number of a formula & x is the Godel number of 
a variable & t is the Godel number of term and (/, x , t) e Sbl & 

a = (h( (), A(V)> *x*f* <*(—)> * Sb (/, x, t) * (hO))l 

13. The set of Godel numbers of formulas of the form Vx(a —^ ft ) —► 
V x a —>• V x P is representable. 

Proof, y is of this form iff (3 variable x < y )(3 formulas 
a, f) < y) [y — Vx(a —► ft ) —* Via —* Vi fi]. This is eas¬ 
ily rewritten in terms of Godel numbers, as in 12. H 

14. The set of Godel numbers of formulas of the form a —> Via, 
where x does not occur free in a, is representable. 

Proof. Similar to 13. H 

15. The set of Godel numbers of formulas of the form x = x is 
representable. 
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Proof. Similar to 13. H 

16. The set of Godel numbers of formulas of the form x — y —►« —*«', 
where a is atomic and a' is obtained from a by replacing x at zero or 
more places by y, is representable. 

Proof. This is similar to 13, except for the relation of “partial 
substitution.” Let (a, b, x, y) e Psb iff x and y are Godel numbers 
of variables, a is the Godel number of an atomic formula, b is a 
sequence number of length lh a, and for all j < Ih a, either («) ; = 
(b)j or ( a)j — x and (b)j — y. This relation is representable. H 

17. The set of Godel numbers of logical axioms is representable. 

Proof, a is a logical axiom iff 3/3 < a such that cy is a generaliza¬ 
tion of ft and ft is in one of the sets in items 11-16. H 

18. For a finite set A of formulas, 

{G(D) I D is a deduction from A} 

is representable. In fact it is enough here for ;JA to be representable. 

Proof. A number d belongs to this set iff d is a sequence number 
of positive length and for every i less than lh d, either 

1. ((/),■ e tJA, 

2. (d)j is the Godel number of a logical axiom, or 

3. (3 j,k< i)[(d)j = (h(0)*(d) k *(h(^))*(d) i *(hO))]. 

This is representable whenever #A is, as is certainly the case 
for finite A. H 

19. Any recursive relation is representable in Cn A E . 

Proof. Recall that the relation R is recursive iff there is some finite 
consistent set A of sentences such that some formula p represents 
R in Cn A. (There is no loss of generality in assuming that the 
language has only finitely many parameters: those in the finite set 
A, those in p, and 0, S, and V.) In the case of a unary relation R , 
we have that a e R iff the least D which is a deduction from A of 
either p(S"0) or -i p (S" 0) is, in fact, a deduction of the former. 

More formally, a e R iff the last component of/(«) is LplS"0). 
where 

f (a ) = the least d such that d is in the set of item 18 and the 
last component of d is either Up(S n 0) or ft — 1 2 3 p(S fl 0). 

For this (fixed) p, there always is such a d. H 

Since the converse to item 19 is immediate, we have 

Theorem 34A A relation is recursive iff it is representable in the 
theory Cn A^. 
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Henceforth we will usually use the word “recursive” in preference 
to “representable.” 

Corollary 34 B Any recursive relation is definable in fR. 

20. Now suppose we have a set A of sentences such that JJ A is re¬ 
cursive. Then fl Cn A need not be recursive (as we will show in the next 
section). But we do have a way of defining Cn A from A: 

a e ft Cn A iff 3d\d is the number of a deduction from A 

and the last component of d is a and a is the 
Godel number of a sentence] 

The part in square brackets is recursive, by the proof to item 18. But we 
cannot in general put any bound on the number d. The best we can say 
is that Cn A is the domain of a recursive relation (or, as we will say 
later, is recursively enumerable). 

Item 20 will play a key role our subsequent work. In particular, it 
will later be restated as Theorem 351. 

21. If tJ A is recursive and Cn A is a complete theory, then JJ Cn A is 
recursive. 

In other words, a complete recursively axiomatizable theory is re¬ 
cursive. This is the analogue to Corollary 25G, which asserts that a 
complete axiomatizable theory is decidable. 

The proof is essentially unchanged. Let (in the consistent case) 

g (s ) = the least d such that s is not the Godel number of a sentence, 
or d is in the set of item 18 and the last component of d is 
either s or is (/;((), h(- 1 )) * s * (/r())). 

Thus g(tler ) is Q of the least deduction of cr or (—kt) from A. And 
s e JJ Cn A iff s > 0 and the last component of g(s) is s. H 

At this point we might reconsider the plausibility of Church’s thesis. 
Suppose that the relation R is decidable. Then there is a finite list of 
explicit instructions (a program) for the decision procedure. The pro¬ 
cedure itself will presumably consist of certain atomic steps, which 
are then performed repeatedly. (The reader familiar with computer pro¬ 
gramming will know that a short program can still require much time 
for its execution, but some commands will be utilized over and over.) 
Any one atomic step is presumably very simple. 

By devices akin to Godel numbering, we can mirror the decision 
procedure in the integers. The characteristic function of R can then be 
put in the form 
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K R (a ) = t/ [ the least s such that 

(i) (Mo encodes the input a; 

(ii) for all positive i < Ih s , (.y), is obtained from (,y)i_] by perfor¬ 
mance of the applicable atomic step; 

(iii) the last component of s describes a terminal situation, at which 
the computation is completed], 

where U (the upshot function) is some simple function that extracts 
from the last component of s the answer (affirmative or negative). The 
recursiveness of R is now reduced to the recursiveness of U and of 
the relations indicated in (i), (ii), and (iii). In special cases, such as 
decision procedures provided by the register machines of Section 3.6, 
the recursiveness of these components is easily verified. It seems most 
improbable that any decision procedure will ever be regarded as effective 
and yet will have components that are nonrecursive. For example, in (ii), 
it seems that it ought to be possible to make each atomic step extremely 
simple, and in particular to make each one recursive. 

Exercises 

1. Supply a proof for item 9 of this section. 

2. Supply a proof for item 11.4 of this section. 

3. Use Exercise 11 of Section 3.3 to give a new proof that the set of 
Godel numbers of terms is representable (item 2). 

4. Let T be a consistent recursively axiomatizable theory (in a recur¬ 
sively numbered language with 0 and S). Show that any relation 
representable in T must be recursive. 


SECTION 3.5 _ 

Incompleteness and Undecidability 

In this section we reap the rewards of our work in Sections 3.3 and 3.4. 
We have assigned Godel numbers to expressions, and we have shown 
that certain intuitively decidable relations on N (related to syntactical 
notions about expressions) are representable in Cn A E . 

Throughout this section we assume that the language in question is 
the language of (R. (This affects the meaning of “Cn” and “theory.”) 

Fixed-Point Lemma Given any formula fi in which only V \ occurs 
free, we can find a sentence a such that 

A e h [a <-*■ /f(S #cr 0)]. 

We can think of a as indirectly saying, “ft is true of me.” Ac¬ 
tually, of course, a doesn’t say anything; it’s just a string of sym¬ 
bols. And even when translated into English according to the intended 
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structure 93, it then talks of numbers and their successors and products 
and so forth. It is only by virtue of our having associated numbers with 
expressions that we can think of cr as referring to a formula, in this case 
to a itself. 

Proof. Let 9(v i, v 3 , v 3 ) functionally represent in Cn Ae a func¬ 
tion whose value at (ft a, n) is tl(a(S"0)). (See items 5 and 6 in 
Section 3.4.) First consider the formula 

Vvt,[0(v\,' v u v 3 ) -+ ft(v 3 )]. (1) 

(We may suppose v 3 is substitutable for V] in /l. The above formula 
has only V\ free. It defines in 93 a set to which ja belongs iff 
tKadS^O)) is in the set defined by ft.) Let cj be the Godel number 
of (1). Let a be 

V®3[d(S«0, S«0, v 3 ) ->• ft(v 3 )]. 

Thus cr is obtained from (1) by replacing V\ , by S'M). Notice that 
a does assert (under 93) that flcr is in the set defined by ft. But we 
must check that 

a <-► ft(S :a 0) (2) 

is a consequence of Ae . Because 0 functionally represents a func¬ 
tion whose value at (q, q) is (ter, we have 

A f hV u 3 [@(S 9 0, S 9 0, v 3 ) <-► v 3 = S #ff 0], (3) 

We can obtain (2) as follows: 

(—r) It is clear (by looking at cr) that 

cr h 6»(S«0, S ? 0, S #cr ) -► ^(S^O). 

And, by (3), 

A e l-0(S 9 O, S^O, S #ff 0). 

Hence 

A e \ cj L ft(S t<r 0), 

which gives half of (2). 

(<—) The sentence in (3) implies 

ft(S &a 0) —> [V v 3 (9 (S 9 0, S ? 0, v 3 ) ft(v 3 ))]. 

But the part in square brackets is just a. H 

(Sometimes the notation cr is used for S :o T). In this notation, the 
fixed-point lemma states that Ae L (cr <—r ftfta^)).) 

Our first application of this lemma does not concern the subtheory 
Cn Ae, and requires only the weaker fact that 

Hn [ct~/J(S : H))]. 
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Tarski Undefinability Theorem (1933) The set J) Th 93 is not defin¬ 
able in 93. 

Proof. Consider any formula fi (which you suspect might define 
L I’ll 93). By the fixed-point lemma (applied to -i /l) we have a 
sentence a such that 

Nn [o’ •*-> -i /f(S #CT 0)]. 

(If fi did define Jt Th 93, then a would indirectly say “I am false.”) 
Then 

Nn o A=A fcN /HS# ff 0), 

so either er is true but (its Godel number is) not in the set fi defines, 
or it is false and in that set. Either way a shows that fi cannot define 
tt Th93. H 

The above theorem gives at once the undecidability of the theory 
of 93: 

Corollary 35A (J Th 93 is not recursive. 

Proof. Any recursive set is (by Corollary 34B) definable in 93. 

H 


Godel Incompleteness Theorem (1931) If A c Th93 and J \A is re¬ 
cursive, then Cn A is not a complete theory. 

Thus there is no complete recursive axiomatization of Th 93. 

Proof. Since Ac Th 93, we have Cn A c Th 93. If Cn A is a com¬ 
plete theory, then equality holds. But if Cn A is a complete theory, 
then (JCn A is recursive (item 21 of the preceding section). And 
by the above corollary, JJ Th 93 is not recursive, H 

In particular, Cn A E is not a complete theory and so is not equal to 
Th 93. And the incompleteness would not be eliminated by the addition 
of any recursive set of true axioms. (By a recursive set of sentences we 
mean of course a set E for which jJE is recursive.) 

We can extract more information from the proof of Godel’s theorem. 
Suppose we have a particular recursive A C Th 93 in mind. Then by 
item 20 in Section 3.4 we can find a formula fi that defines U Cn A in 
93. The sentence a produced by the proof of Tarski’s theorem is (as we 
noted there) a true sentence not in Cn A. This sentence asserts that JJcr 
does not belong to the set defined by fi, i.e., it indirectly says, “I am 
not a theorem of A.” Thus AY- er, and of course AY —\ a as well. This 
way of viewing the proof is closer to Godel’s original proof, which did 
not involve a detour through Tarski’s theorem. For that matter, Godel’s 
statement of the theorem did not involve Th93; we have taken some 
liberties in the labeling of theorems. 
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Next we need a lemma which says (roughly) that one can add one 
new axiom (and hence finitely many new axioms) to a recursive theory 
without losing the property of recursiveness. 

Lemma 35B If j Cn E is recursive, then ft Cn(E; r) is recursive. 

Proof, a e Cn(E; r) (r —► a) e Cn E. Thus 

a e JjCn(E; r) <f=)> a is the Godel number of a sentence 

and (/;(()) * |Jt * (h( —»)) * a * (h ())) 
is in Jt Cn E. 

This is recursive by the results of the preceding sections. H 

Theorem 35C (Strong Undecidability of Cn A e ) Let T be a the¬ 
ory such that T U A E is consistent. Then "JT is not recursive. 

(Notice that because throughout this section the language in question 
is the language of (ft, the word “theory” here means “theory in the 
language of (ft.”) 

Proof. Let T' be the theory Cn (T U A E ). If JT is recursive, then 
since A E is finite we can conclude by the above lemma that ft T' 
is also recursive. 

Suppose, then, that $T' is recursive and so is represented in 
Cn A e by some formula fi. From the fixed-point lemma we get a 
sentence a such that 

Ae l~ [er -o ■ >3 (S Bcr 0)]. (*) 

(Indirectly a asserts, “I am not in T'.”) 

cr tr => fat t tr 

=> A e h -.^(S^O) 

=^> A e F a by (*) 

a e V. 

So we get a e T'. But this, too, is untenable: 

cr e r => tto- e tt T' 

=> A e h ^(S» ff 0) 

=^> Ae I-' cr by (*) 

=► (--O’)e r, 

which contradicts the consistency of T'. H 

Corollary 35D Assume that flE is recursive and E U A E is con¬ 
sistent. Then Cn E is not a complete theory. 

Proof. A complete recursively axiomatizable theory is recursive 
(item 21 of Section 3.4). But JJ Cn E is not recursive, by the above 
theorem. H 

This corollary is Godel’s incompleteness theorem again, but with 
truth in 03 replaced by consistency with A E ■ 
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Church's Theorem (1936) The set of Godel numbers of valid sen¬ 
tences (in the language of 91) is not recursive. 

Proof. In the strong undecidability of Cn ,4 E , take T to be the 
smallest theory in the language, the set of valid sentences. H 

The set of Godel numbers of valid wffs is not recursive either, lest 
the set of valid sentences be recursive. 

This proof applies to the language of 91. For a language with more 
parameters, the set of valid sentences is still nonrecursive (lest its inter¬ 
section with the language of 91 be recursive). Actually it is enough for 
the language to contain at least one two-place predicate symbol. (See 
Corollary 37G.) On the other hand, some lower bound on the language 
is needed. If the language has V as its only parameter (the language of 
equality), then the set of valid formulas is decidable. (See Exercise 6.) 
More generally, it is known that if the only parameters are V and one- 
place predicate symbols, then the set of valid formulas is decidable. 

Recursive Enumerability 

A relation on the natural numbers is said to be recursively enumerable 
iff it is of the form 


{a | 3 b (a, b) e Q} 

with Q recursive. Recursively enumerable relations play an important 
role in logic. They constitute the formal counterpart to the effectively 
enumerable relations (as will be explained presently). 

(The standard abbreviation for “recursively enumerable” is “r.e.” 
When the term “computable” is used instead of “recursive,” then one 
speaks of computably enumerable — abbreviated c.e. —relations.) 

Recursively enumerable relations are — like the recursive relations 
— definable in 91. If ip(v\, Vi) defines in 91 a binary relation Q, then 
3 V 2 (p(v i, Vo) defines {« | 3 b (a, b) e 21- 

Theorem 35E The following conditions on an m- ary relation R are 
equivalent: 

1. R is recursively enumerable. 

2. R is the domain of some recursive relation Q. 

3. For some recursive (m + l)-ary relation Q, 

R = {(fli,- a m ) | 3 b {a\ -- a m , b) e Q}. 

4. For some recursive (m + n)-ary relation Q , 

R = {(fli, .. | 3b u ...,b„ («!-- a m , b x -- b n ) e Q }. 

Proof. By definition 1 and 3 are equivalent. Also 2 and 3 are equiv¬ 
alent by our definition (in Chapter 0) of domain and (/« + l)-tuple. 



Chapter 3: Undecidability 


239 


Clearly 3 implies 4. So we have only to show that 4 implies 3. 
This is because 

3b u - b n (a i -- a m ,bi ,..., b„) e Q 

iff 3c (ai,..., a m , (c) 0 ,..., (c)„_i) e Q 

and 

{(«!..., a m , c) | (ai,..., a m , (c) 0 -, (c)„_i) e Q} 

is recursive whenever Q is recursive. (Here we have used our 
sequence decoding function to collapse a string of quantifiers into 
a single one.) H 

By part 4 of this theorem, R is recursively enumerable iff it is defin¬ 
able in 91 by a formula 3 x\ • • • 3 x n <p, where <p is numeralwise deter¬ 
mined by A e . In fact, we can require here that <p be quantifier-free; this 
result was proved in 1961 (with exponentiation) and in 1970 (without 
exponentiation). The proofs involve some number theory; we will omit 
them here. 

Notice that any recursive relation is also recursively enumerable. For 
if R is recursive, then it is defined in 91 by a formula 3x\ ■ ■ ■ 3 x n <p, 
where (p is numeralwise determined by Ae and x\ ,..., x n do not occur 
in (p. 

Theorem 35F A relation is recursive iff both it and its complement 
are recursively enumerable. 

This is the formal counterpart to the fact (cf. Theorem 17F) that a 
relation is decidable iff both it and its complement can be effectively 
enumerated. 

Proof. If a relation is recursive, then so is its complement, whence 
both are recursively enumerable. 

Conversely, suppose that both P and its complement are re¬ 
cursively enumerable; thus for any ci, 

a e P <s> 3b (3, b) e Q 
a P <£> 3b (a, b) e R 

for some recursive Q and R. Let 

f(a) = the least b such that either (a, b) e Q or (a, b) e R. 

Such a number b always exists, and / is recursive. Finally, 
a e P (a, f(a )) e Q, 

so P is recursive. H 

The recursively enumerable relations constitute the formal counter¬ 
part of the effectively enumerable relations. For we have the following 
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informal result, paralleling a characterization of recursive enumerability 
given by Theorem 35E. 

*Lemma 35G A relation is effectively enumerable iff it is the do¬ 
main of a decidable relation. 

Proof. Assume that Q is effectively enumerated by some proce¬ 
dure. Then a e Q iff 3 n[a appears in the enumeration in n steps]. 
The relation defined in square brackets is decidable and has do¬ 
main Q. 

Conversely, to enumerate {(a,b) |3 n(a,b,n) e R} for de¬ 
cidable R, we check to see if ((/«)o, (m)i, (m) 2 } e R for m = 
0, 1,2,.... Whenever the answer is affirmative, we place {(m) 0 , 
(m) i> on the output list. H 

*Corollary 35H (Church's Thesis, Second Form) A relation is 
effectively enumerable iff it is recursively enumerable. 

Proof. By identifying the class of decidable relations with the class 
of recursive relations, we automatically identify the domains of 
decidable relations with domains of recursive relations. H 

The second form of Church’s thesis is, in fact, equivalent to the first 
form. To prove the first form from the second, we use Theorems 35F 
and 17F. 

We have already shown that a recursively axiomatizable theory is 
recursively enumerable, but using different words. We restate the result 
here, as it indicates the role recursive enumerability plays in logic. 

Theorem 351 If A is a set of sentences such that f )A is recursive, 
then [J Cn A is recursively enumerable. 

Proof. Item 20 of Section 3.4. H 

In particular, ft Cn A E is recursively enumerable, but (by Theorem 
35C) it is not recursive. In the next section, we will see other examples 
of recursively enumerable sets that are not recursive. 

This theorem is the precise counterpart of the informal fact that a 
theory with a decidable set of axioms is effectively enumerable (Corol¬ 
laries 25F and 261). It indicates the gap between what is provable in 
an axiomatic theory and what is true in the intended structure. With 
a recursive set of axioms, all we can possibly obtain is a recursively 
enumerable set of consequences. But by Tarski’s theorem, ThOI is not 
even definable in fR, much less recursively enumerable. 

Even if we expand the language or add new axioms, the same 
phenomenon is present. As long as we can recursively distinguish de¬ 
ductions from nondeductions, the set of theorems can be only recur¬ 
sively enumerable. For example, the set of sentences of number theory 
provable in your favorite system of axiomatic set theory is recursively 
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enumerable. Furthermore, this set includes A E and is consistent (un¬ 
less you have very strange favorites). It follows that this set theory is 
nonrecursive and incomplete, (This topic is discussed more carefully in 
Section 3.7.) 

Weak Representability 

Consider a recursively enumerable set Q , where 
a e Q 3b (a, b) e R 

for recursive R. We know there is a formula p that represents R in 
Cn A E . Consequently, the formula 3v2p defines Q in 9L This formula 
cannot represent Q in Cn A e unless Q is recursive. But it can come 
halfway. 

a G Q {a. b) £ R 

A e h p(S fl 0, S 6 0) 

=>• A e F 3 t> 2 p(S fl 0, v 2 ) 
a £ Q =>■ (a.b) £ R 

A e h -.p(S fl 0,S fe 0) 

=A> A e ^ 3 v 2 p{ S fl 0, v 2 ) 

The last step is justified by the fact that if A E h -> ^(S^O) for all 
b, then A E )A 3.v cp{x). (The term co-consistency is given to this prop¬ 
erty.) For it is impossible for 3 x cp(x), —< y?(S°0), —i ^(S'O),... all to be 
true in 91 

Thus we have 


for some b 
for some b 

for all b 
for all b 


a e Q A e F 3 ibp(S fl 0, v 2 ). 

It will be convenient to formulate a definition of this half of repre¬ 
sentability. 

Definition. Let Q be an n-ary relation on N, \j/ a formula in which 
only Vi,... ,v n occur free. Then i// weakly represents Q in a 
theory T iff for every ai,... ,a„ in 94, 

a n ) eg o S ai 0,..., S fl "0) e T. 

Observe that if Q is representable in a consistent theory T, then Q 
is also weakly representable in T. 

Theorem 35J A relation is weakly representable in Cn A E iff it is 
recursively enumerable. 

Proof. We just showed that a recursively enumerable unary rela¬ 
tion Q is weakly representable in Cn A E \ the same proof applies 
to n-ary Q with only notational changes. Conversely, let Q be 
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weakly represented by i// in Cn A E . Then 

(fli,..., a n ) e Q <£> 3D [ D is a deduction of \J/(S ai 0 .S fl, '0) 

from the axioms A E ] 

3d (d, a,,)) e P 

for a certain recursive function / and recursive relation P. -\ 

Arithmetical Hierarchy 

Define a relation on the natural numbers to be arithmetical iff it is 
definable in 9T But some arithmetical relations are, in a sense, more 
definable than others. We can organize the arithmetical relations into a 
hierarchy according to how definable the relations are. 

Let Ei be the class of recursively enumerable relations; these rela¬ 
tions are “one quantifier away” from recursiveness. Extending this idea, 
we define the class of Y. k relations and the class of 11/ relations. For 
example, the first few classes consist of relations of the form shown in 
the second column: 


Li : 

{a 

3b {d,b) e R}, 

R recursive 

n, : 

[a 

Wb (a, b) e Rj, 

R recursive 

E 2 : 

{a 

3cVb (a, b, c) e Rj, 

R recursive 

n 2 : 

{a 

Vc3b (a, b, c) e R }, 

R recursive 


In general, a relation Q is in iff it is of the form 
[a | V/?i3Z?2 • • ■ Ob k (a, b) e R} 

for a recursive relation R. Here is to be replaced by “V” if k is odd 
and by “3” if k is even. Similarly, Q is in E/, iff it has the form 

[a | 3b{ib 2 ---Ub k {a,b) e R] 

for recursive R , where now is replaced by “3” if k is odd and by 
“V” if k is even. 

The classes E/ f and 11/,. can also be defined by recursion on k. Ei is 
the class of recursively enumerable relations. Next, a relation belongs 
to n/ iff its complement is in E /; . And a relation is in E /<+l iff it is the 
domain of a relation in n/;.. (We can even start from k = 0, by letting 
Eq be the class of recursive relations.) 

Example. The set of Godel numbers of formulas numeralwise 
determined by Ae is in n 2 . 

Proof, a belongs to this set iff \a is the Godel number of a for¬ 
mula a] and Vb3d[d is Q of a deduction from A E either of 
a(S (fe) °0,S Wl 0,...) or of the negation of this sentence]. By the 
technique of Section 3.4, we can show that the phrases in 
square brackets define recursive relations. By using the English 
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counterpart to prenex form, we obtain the desired form, 

{a | Vb3d{a,b,d) e R }, 

with /t recursive. H 

One more bit of notation: Let A\ be the class of recursive relations. 
Then our earlier result (Theorem 35F) stating that a relation is recursive 
iff both it and its complement are recursively enumerable can now be 
stated by the equation 

Ai = Si 0 TTi. 

Since this equation holds, we proceed to define A„ for n > 1 by the 
analogous equation, 

a n — n n„. 

The following inclusions hold: 


0 / 0 / 0 / 

Ai zJ 2 A z 

'S' 0/ O/ 

u x n 2 n 3 


The case A] C Ei was mentioned previously (cf. Theorem 35F); its 
proof hinged on the possibility of “vacuous quantification.” The proofs 
of the other cases are conceptually the same. If x does not occur in 
<p, then (p, \/xcp, and 3x<p are all equivalent. For example, a rela¬ 
tion in Ei is defined by a formula 3 y cp , where tp is numeralwise de¬ 
termined by A But the same relation is defined by 3 y V v (p and 
V x 3 y q> (where x does not occur in <p). Hence the relation is also in E 2 
and n 2 . 

It is also true that all the inclusions shown are proper inclusions, 
i.e., equality does not hold. But we will not prove this fact here. The 
inclusions are shown pictorially in Fig. 10. 

The class of arithmetical relations equals |J 4 E^. and also |J t TL. For 
example, any relation in E 2 is arithmetical, being defined in 01 by a for¬ 
mula 3 x V y q>, where (p is numeralwise determined by Ae ■ Conversely, 
any arithmetical relation is defined in fR by some prenex formula. The 
quantifier-free part of this prenex formula defines a recursive relation 
(since quantifier-free formulas are numeralwise determined by Ae). 
Consequently, the defined relation falls somewhere in the hierarchy. 
The technique of “collapsing” 33 ■ ■ ■ 3 quantifiers used in the proof of 
Theorem 35E (and its dual technique for W ■ ■ • V) can be used to good 
advantage here. 
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Thus we have the following result, which relates definability in TI to 
the hierarchy we have just built up from the recursive relations: 

Theorem 35K A relation on the natural numbers is arithmetical (i.e., 
definable in 9T) iff it is in E* for some k, and this property in turn 
is equivalent to being in 11/ for some /. 

In particular, any recursively enumerable relation is arithmetical, as 
noted previously. 

There are certain tricks which are useful in locating specific arith¬ 
metical relations in the hierarchy. For example, let A be the set of Godel 
numbers of formulas a such that for some n, 

Ae F a(S"0) and (Vi < n) Ae I-ia(S'0). 

Then a e A iff [a is the Godel of a wff a] and 3 n3D[D is a deduction 
from A e of a(S"0)] and (Vi < n)(3D,)[D, is a deduction from A E of 
—i ff(S'O) |. The parts in square brackets are recursive so we count the 
remaining quantifiers. The bounded quantifier “Vi < n” need not be 
counted. For we have 

(Vi < n)(3d)(d, i) e P <S> (3d)(Vi < n)((d)i , i) € P. 

Use of this fact lets us push the bounded quantifier inward until it merges 
with the recursive part. Consequently, A e Si. 

The following theorem generalizes Theorem 351. 
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Theorem 35L Let A be a set of sentences such that £4 is in 
where k > 0. Then £ Cn A is also in E/ ; . 

Proof. Return to the proofs of items 18 and 20 of Section 3.4. We 
had there: 

a e £ Cn A <£> a is the Godel number of a sentence and 3 d\d is a 

sequence number and the last component of d is a and 
for every i less than lh d, either (1) (d) l e £A, (2) (d) t 
is the Godel number of a logical axiom, or (3) for some 
j and l less than i, ( d)j = (/;(()) * (d)i * ( h( — ► )) 

*{d)i * </;()))]• 

Since £4 e in (1) we must replace “(d); e £4” by something 
of the form 

3 biVb2---nb k mi,b) e Q 

for recursive Q. It remains to convert the result into an English 
prenex expression in E* form. We suggest that the reader set k = 2 
and write out this expression; the device used in the preceding 
example will help. H 


Exercises 

1. Show that there is no recursive set R such that £ Cn Ae C R and 
£{cr | (—i er) e Cn Ae} C R. the complement of R. (This result can 
be stated: The theorems of A E cannot be recursively separated from 
the refutable sentences.) Suggestion: Make a sentence a saying “My 
Godel number is not in R.” Look to see where £er is. 

2. Let 4 be a recursive set of sentences in a recursively numbered 
language with 0 and S. Assume that every recursive relation is rep¬ 
resentable in the theory Cn 4. Further assume that 4 is ('^-consistent; 
i.e., there is no formula tp such that 4 h 3 x cp(x) and for all a e N, 

4 1- 1 <p(S"0). Construct a sentence a indirectly asserting that it 

is not a theorem of 4, and show that neither 4 h a nor 4 1-a. 

Suggestion: See Section 3.0. 

Remark: This is a version of the incompleteness theorem that 
is closer to Godel’s original 1931 argument. Note that there is no 
requirement that the axioms 4 be true in fR. Nor is it required that 
4 include 4^; the fixed-point argument can still be applied. 

3. Let T be a theory in a recursively numbered language (with 0 and S). 
Assume that all recursive subsets of N are weakly representable in 
T. Show that £T is not recursive. Suggestion: Construct a binary 
relation P such that any weakly representable subset of N equals 
[b | (a, b) e P] for some a, and such that P is recursive if £7 is. 
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Consider the set H = {b\ (b, b) ^ P}. See Section 3.0. The argu¬ 
ment given there for the “diagonal approach” in the special case 
T = I’ll Of can be adapted here. 

Remark : This exercise gives a version of the result, “Any suffi¬ 
ciently strong theory is undecidable.” 

4. Show that there exist 2 K(I nonisomorphic countable models of Th 07. 
Suggestion: For each set A of primes, make a model having an 
element divisible by exactly the primes in A. 

5. (Lindenbaum) Let T be a decidable consistent theory (in a rea¬ 
sonable language). Show that T can be extended to a complete 
decidable consistent theory T'. Suggestion: Examine in turn each 
sentence a: add either o or —i a to 7’. But take care to maintain 
decidability. 

6 . Consider the language of equality, having V as its only parameter. 
Let k„ be the translation of “There are at least n things,” cf. the 
proof of Theorem 26A. Call a formula simple iff it can be built up 
from atomic formulas and the a„’s by use of connective symbols 
(but no quantifiers). Show how, given any formula in the language 
of equality, we can find a logically equivalent simple formula. Sug¬ 
gestion: View this as an elimination-of-quantifiers result (where the 
quantifiers in X n do not count). Use Theorem 3 IF. 

7. (a) Assume that A and B are subsets of N belonging to E* (or 

n*). Show that A U B and A fl B also belong to E* (or TL, 
respectively). 

(b) Assume that A is in E/, (or FL) and that the functions f \,..., 
/,„ are recursive. Show that 

{a : </i(2),..., f„(a)) e A} 

is also in E* (or F4, respectively). Suggestion: First do this for 
Ei. Then observe that the argument used can be generalized. 

8 . Let T be a theory in a recursively numbered language (with 0 and 
S). Let n be fixed, n > 0. Assume that all subsets of N in E„ are 
weakly representable in T. Show that #T is not in n„. (Observe that 
Exercise 3 is a special case of this, wherein n — 0. The suggestions 
given there carry over to the present case.) 

9. Show that 

{(Jcr | A E ; a is coconsistent} 

(see Exercise 2) is a IT 3 set. 

10. The theory Cn A E has many complete extensions, of which Th s Jt 
is but one. How many? That is, what is the cardinality of the set of 
complete theories (in the language) that extend A E ? 
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SECTION 3.6 _ 

Recursive Functions 

We have used recursive functions (i.e., functions that, when viewed 
as relations, are recursive) to obtain theorems of incompleteness and 
undecidability of theories. But the class of recursive functions is also 
an interesting class in its own right, and in this section we will indicate 
a few of its properties. 

Recall that by Church’s thesis, a function is recursive iff it is com¬ 
putable by an effective procedure (page 210). This fact is responsi¬ 
ble for much of the interest in recursive functions. At the same time, 
this fact makes possible an intuitive understanding of recursiveness, 
which greatly facilitates the study of the subject. Suppose, for exam¬ 
ple, that you are suddenly asked whether or not the inverse of a re¬ 
cursive permutation of N is recursive. Before trying to prove this, you 
should first ask yourself the intuitive counterpart: Is the inverse of a 
computable permutation / also computable? You then — one hopes — 
perceive that the answer is affirmative. To compute / -1 (3), you can 
compute /(0), /(l), ... until for some k it is found that f(k) = 3. 
Then / - 1 (3) = k. Having done this, you have gained two advan¬ 
tages. For one, you feel certain that the answer to the question re¬ 
garding recursive permutations must also be affirmative. And secondly, 
you have a good outline of how to prove this; the proof is found 
by making rigorous the intuitive proof. This strategy for approach¬ 
ing problems involving recursiveness will be very useful in this 
section. 

Before proceeding, it might be wise to summarize here some of the 
facts about recursive functions we have already established. We know 
that a function / is recursive iff it (as a relation) is representable in 
Cn Ae, by Theorem 34A. Consequently, every recursive function is 
weakly representable in this theory. 

In Section 3.3 a repertoire of recursive functions was developed. In 
addition, it was shown that the class of recursive functions is closed 
under certain operations, such as composition (Theorem 33L) and the 
“least-zero” operator (Theorem 33M). 

We also know of a few functions that are not recursive. There are 
uncountably many (to be exact, 2 ^°) functions from N m into N altogether, 
but only countably many of them can be recursive. So an abundance of 
nonrecursive functions exists, despite the fact that the most commonly 
met functions (such as the polynomials) were shown in Section 3.3 to be 
recursive. By catalog item 1 of Section 3.3, the characteristic function of 
a nonrecursive set is nonrecursive. For example, if f(a) — \ whenever 
a is the Godel number of a member of Cn ,4 E and f(a) = 0 otherwise, 
then / is not recursive. 
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Normal Form 

For any computable function, such as the polynomial function a 2 + 3a + 
5, one can in principle design a digital computer into which one feeds 
a and out of which comes a 2 + 3a + 5 (Fig. 11). But if you then want 
a different function, you must build a different computer. (Or change 
the wiring in the one you have.) It was recognized long ago that it is 
usually more desirable to build a single general-purpose stored-program 
computer. Into this you feed both a and the program for computing your 
polynomial (Fig. 12). This “universal” computer requires two inputs, 
and it will compute any one-place computable function (if supplied 
with enough memory space), provided that the right program is fed 
into it. Of course, there are some programs that do not produce any 
function on N, as many a programmer has, to his sorrow, discovered. 
(Such programs produce malfunctions instead!) 

a 


a 1 + 3a + 5 

Figure 11. Special-purpose 
computer. 


a 2 + 3a + 5 

Figure 12. General-purpose computer. 

In this subsection and the next, we will repeat what has just been 
said, but with recursive functions and with proofs. For our universal 
computer we will have a recursive relation 7) and a recursive function 
U. Then for any recursive / : N —»■ N there will exist an e (analogous 
to the program) such that 

f(a ) = U (the least k such that (e , a, k) e 7j) 

= U()ik ( e , a, k) e 7j), 

where the second equation is to be understood as being an abbreviation 
for the first. Actually e will here be the Godel number of a formula 
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(p that represents (or at least weakly represents) / in Cn A E . And the 
numbers k for which (e, a. k) e T\ will encode both f(a) and Q of a 
deduction from A E of <p(S"0. S /( "’0). 

Definition. For each positive integer m, let T m be the (in + 2)-ary 
relation to which an (m + 2)-tuple (e,a\,..., a m , k) belongs iff 

(i) e is the Godel number of a formula (p in which only v\,..., 
v m , v m+ \ occur free; 

(ii) k is a sequence number of length 2, and (k) o is Q of a 

deduction from A E of ip (S" 1 0.S"" : 0. S (,I)| 0). 

The idea here is that for any one-place recursive function / we 
can first of all take e to be the Godel number of a formula cp weakly 
representing / (as a relation). Then we know that for any a and b, 

A e h <p(S fl 0, S*0) iff b — f(a). 

So any number k meeting clause (ii) of the definition must equal 
((k) o, f(a)), where (k) o is Q of a deduction of <p(S fl 0, S^ (fl) 0) from 
A e . (We have departed from the usual definition of T m here by not 
requiring that k be as small as possible.) 

Take for the “upshot” function U the function 

U(k) = (k) i. 

This U is recursive and in the situation described in the preceding para¬ 
graph we have U ( k ) = f(a). 

Lemma 36A For each m, the relation T m is recursive. 

Proof, for m— 2. (e, a \, « 2 , k) e T 2 iff e is the Godel number of 

a formula, #(V Ui V V 2 V vj) * e is the Godel number of a sentence, 
k is a sequence number of length 2, and (A:)o is Q of a deduction 
from A e of 

Sb(Sb(Sb(e, JJui, g(ai)), tJu 2 , g(a 2 )), tlV3, g((k) 1 )), 

where g(n) = (JS''0. From Section 3.4 we know all this to be 
recursive. H 

Theorem 36B (a) For any recursive function / : N m —> N, there is 

an e such that for all rq, ..., a m , 

f(a\ -- a m ) = U(pik (e. a 1 ,..., a m , k) e T m ). 

(In particular, such a number k exists.) 

(b) Conversely, for any e such that Vci\ ■ ■ ■ a m 3k (e,a 1 ,..., 
a m , k) e T m , the function whose value at a \,..., a m is U(ptk(e, 
a 1 ,..., a m , k) e T m ) is recursive. 
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Proof. Part (b) follows immediately from the fact that U and T m 
are recursive. As for part (a), we take for e the Godel number of 
a formula <p weakly representing / in Cn ,4 E . Given any a, we 
know that A E h ^(S fll 0, ..., S""'0, S^ (a) 0). If we let d be Q of a 
deduction from A E of this sentence, then {e , a , (d, f(a))) e T m . 
Hence there is some k for which (e, a, k) e T,„. And for any such 
k, we know that A E h ^(S fll 0,..., S"" : (). S (fc)l 0), since (k) 0 is Q 
of a deduction. Consequently, U (k) — (k)\ — f (a) by our choice 
of <p. Thus we have U(p.k { e , a, k) e T m ) — f(a). H 

This theorem, due to Kleene in 1936, shows that every recursive 
function is representable in the normal form 

f(d) = U()ik {e , a, k) e T m ). 

Thus a computing machine able to calculate U and the characteristic 
function of T\ is a “universal” computer for one-place recursive func¬ 
tions. The input e corresponds to the program, and it must be chosen 
with care if any output is to result (i.e., if there is to be any k such that 
(. e, a, k) e T\). 

Recursive Partial Functions 

The theory of recursive functions becomes more natural if we consider 
the broader context of partial functions. 

Definition. An m-place partial function is a function / with 
dom / C N m and ran / C N. If a £ dom /, then f(d) is said to 
be undefined. If dom / = N m , then / is said to be total. 

The reader is hereby cautioned against reading too much into our 
choice of the words “partial” and “total” (or the word “undefined,” for 
that matter). A partial function / may or may not be total; the words 
“partial” and “total” are not antonyms. 

We will begin by looking at those partial functions that are informally 
computable. 

^Definition. An m-place partial function / is computable iff there 
is an effective procedure such that (a) given an m -tuple a in dom /, 
the procedure produces /(a); and (b) given an m -tuple a not in 
dom /, the procedure produces no output at all. 

This definition extends the one previously given for total functions. 
At that time we proved a result (Theorem 33H), part of which generalizes 
to partial functions. 

*Theorem 36C An m-place partial function / is computable iff / 
(as an (m + l)-ary relation) is effectively enumerable. 
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Proof. The proof is reminiscent of the proof of another result. 
Theorem 17E. First suppose we have a way of effectively enu¬ 
merating /. Given an m-tuple a, we examine the listing of the 
relation as the procedure churns it out. If and when an (m + 1)- 
tuple beginning with a appears, we print out its last component 
as f(a). 

Conversely, assume that / is computable, and first suppose 
that / is a one-place partial function. We can enumerate f as a 
relation by the following procedure: 

1. Spend one minute calculating /(0). 

2. Spend two minutes calculating /(0), then two minutes cal¬ 
culating /(1). 

3. Spend three minutes calculating /(0), three minutes calcu¬ 
lating /(1), and three minutes calculating /(2). 

And so forth. Of course, whenever one of these calculations 
produces any output, we place the corresponding pair on the list 
of members of the relation /. 

For a computable m -place partial function, instead of calcu¬ 
lating the value of / at 0, 1, 2, ... we calculate its value at ((0)o, 
• • • , (0) m -,>, (G)o, • ■ • , (l)m-l). <(2)o, ■ . ■ , (2)m-l>, etc. H 

In the case of a computable total function /, we were also able to 
conclude that / was a decidable relation. But this may fail for a nontotal 
/. For example, let 

f(a) = { ° iffl G tJCn A e , 

' y undefined otherwise. 

Then / is computable. (We compute f(a ) by enumerating JJCn A E 
and looking for a.) But / is not a decidable relation, lest fJCn A E be 
decidable. On the basis of this example and the foregoing theorem, 
we select our definition for the precise counterpart of the concept of 
computable partial function. 

Definition. A recursive partial function is a partial function that, 
as a relation, is recursively enumerable. 

The reader should be warned that “recursive partial function” is an 
indivisible phrase; a recursive partial function need not be (as relation) 
recursive. But at least for a total function our terminology is consistent 
with past practice. 

Theorem 36D Let / :N" ! -> Nbea total function. Then / is a 
recursive partial function iff / is recursive (as a relation). 

Proof. If / is recursive (as a relation), then a fortiori f is re¬ 
cursively enumerable. Conversely, suppose that / is recursively 
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enumerable. Since / is total, 

f(a) ± b 3 c[f(a) = c & b =£ c]. 

The form of the right-hand side shows that the complement of 
/ is also recursively enumerable. Thus by Theorem 35F, / is 
recursive. H 

In first discussing normal form results, we pictured a two-input de¬ 
vice (Fig. 13). For any computable partial function, there is some pro¬ 
gram that computes it. But now the converse holds: Any program will 
produce some computable partial function. Of course many programs 
will produce the empty function, but that is a computable partial func¬ 
tion. 


program 



f(a) 


Figure 13. Computer with program for /. 


For the recursive partial functions the same considerations apply. 

Define, for each e e N, the m -place partial function [e]] m by 

Qr]]m (i ■ ■.., a m ) — U {fik (r?, cii ,..., a in , k) £ T m ). 

The right-hand side is to be understood as undefined if there is no such 

k. In other words, 

a e domje]],,, iff 3 k (e, ai,, a m , k) e T m , 

in which case the value dej,,, (a) is given by the above equation. 

The following theorem is an improved version of Theorem 36B: 

NoRMAt Form Theorem (Kleene, 1943) (a) The (m + l)-place par¬ 

tial function whose value at (e, a \,..., a m ) is [[e]],„(fli,..., a m ) 
is a recursive partial function. 

(b) For each e>0, ||c[|,„ is an m-place recursive partial 
function. 

(c) Any w-place recursive partial function equals [e]] m for 
some e. 

Proof, (a) We have 

M m (a) = b<£> 3 k[(e, a, k) e T m & U(k) = b &(Vk' < k)(e,a,k') ^ T m ], 
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The part in square brackets is recursive, so the function (as a 
relation) is recursively enumerable. 

(b) The above proof still applies, e now being held fixed. 

(c) Let / be an in -place recursive partial function, so that 

{(3, b) | f(a) = b} is recursively enumerable. Hence there is a 
formula tp that weakly represents this relation in Cn A E . We claim 
that f — UipJ m . For if f(ci ) = b , then A E h <p(S fll 0, ..., S a '"0, 
S fe 0). Hence there is a k such that (fl <p, a, k) e T m . For any such 
k, U(k) — b, since A E )A tp{ S fll 0,..., S" m 0, S c 0) for c ^ b. Sim¬ 
ilarly, if f(a) is undefined, then A £ F ^(S fll 0, ..., S"'“0. S' 0) for 
any c, whence 0<p]]„, is undefined here also. H 

Part (a) of the normal form theorem (in the case m = 1) tells us that 
the function O defined by the equation 

<J>(e, a) — [eJhfa) = U(ptk ( e , a, k) e T\) 

is a recursive partial function. And part (c) tells us that <t> is “universal” 
in the sense that we can get any one-place recursive partial function 
from <J> by holding the first variable fixed at a suitable value. 

The informal counterpart of the universal function <J> is the computer 
operating system. The operating system takes two inputs, the program 
e and the data a. And it runs the program on that data. But the operating 
system itself is computable as a two-place partial function. 

The proof of normal form theorem gives us a way to compute the 
values of our “operating system” <l>, albeit in an extremely inefficient 
way. The straightforward idea of “looking at the program e and doing 
what it says to the data a ” has been obscured, to say the least. 

The function |[e]],„ is said to be the m -place recursive partial function 
with index e. Part (c) of the normal form theorem tells us that every 
recursive partial function has an index. The proof shows that the Godel 
number of a formula weakly representing a function is always an index 
of the function. 

We now have a convenient indexing [OJi, IlJi, ... of all the one- 
place recursive partial functions. Function [[<?]] i is produced by the “in¬ 
structions” encoded by e. Of course, that function will be empty unless 
e is the Godel number of a formula and certain other conditions are met. 

All the recursive total functions are included in our enumeration of 
recursive partial functions. But we cannot tell effectively by looking at 
a number e whether or not it is the index of a total function: 

Theorem 36E {e \ [eJi is total} is not recursive. 

Proof. Call this set A. Consider the function defined by 


f(a ) = 


[Mi (a) + 1 if a e A, 
0 if a ^ A. 
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Then /, by its construction, is total. Is it recursive? We have 

f(a ) = b [(a ^ A & b = 0) or (a e A & 3 k((a, a , A:) e T\ 
&b= U(k) + 1 & (Vj < k){a, a, j) / 7T))]. 

Thus if A is recursive, then / (as a relation) is recursively enu¬ 
merable, But then / is a total recursive function, and so equals 
[eji for some e e A. But/(e) = [e]]i(e)-|-1, so we cannot have 
/ = [ej |. This contradiction shows that A cannot be recursive. 

H 


It is not hard to show that A is in IT 2 . This classification is the best 
possible, as it can be shown that A is not in 5/. 

Theorem 36F The set 


K = {a | [Mi (a) is defined} 


is recursively enumerable but not recursive. 


Proof. K is recursively enumerable, since a e K <£> 3 k (a, a, k) 
e T\. To see that K cannot be recursive, consider the function 
defined by 


8(a) = 


|[a]]i (a) + 1 if a e K, 
0 if a £ K. 


This is a total function. Exactly as in the preceding theorem, we 
have that K cannot be recursive. H 


Corollary 36G (Unsolvability of the Halting Problem) The re¬ 
lation 


{(e, a) | Mila) is defined} 
is not recursive. 

Proof. We have a e K iff the pair (a, a) belongs to this relation. 
(Thus the problem of membership in K is “reducible” to the halt¬ 
ing problem.) If this relation were recursive, then K would be, 
which is not the case. H 

This corollary tells us that there is no effective way to tell, given a 
program e for a recursive partial function and an input a, whether or not 
the function [[e]]i is defined at a. 

We can obtain an indexing of the recursively enumerable relations 
by using the following characterization. 

Theorem 36H A relation on N is recursively enumerable iff it is 
the domain of some recursive partial function. 


Proof. The domain of any recursively enumerable relation is also 
recursively enumerable; cf. part 4 of Theorem 35E. In particular. 
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the domain of any recursive partial function is recursively enu¬ 
merable. 

Conversely, let Q be any recursively enumerable relation, 
where 

a = Q 3b (a, b) e R 
with R recursive. Let 

f(a) = lib (a, b) e R\ 

i.e., 

f(a) = b (a, b) e R & (Vc < b){a, c) R. 

Then /, as a relation, is recursive. Hence / is a recursive partial 
function. Clearly its domain is Q. H 

Thus our indexing of the recursive partial functions induces an in¬ 
dexing of the recursively enumerable relations. Define 

W e = dom[[e]]i. 

Then Wq, W\ , W 2 ,... is a list of all recursively enumerable subsets 
of N. In Theorem 36E we showed that {e\ W e — N} is not recursive. 
Similarly, Theorem 36F asserts that {e \ e e W e } is not recursive. Define 
a relation Q by 

Q = {(e, a) | a e W e }. 

Then Q is recursively enumerable, since (e, a) e Q <£> 3 k ( e , a, k) e T\. 
Furthermore, Q is universal for recursively enumerable sets, in the sense 
that for any recursively enumerable ACM there is some e such that 
A = {« | (e, a) e Q}. The unsolvability of the halting problem can be 
stated: Q is not recursive. 

We can apply the classical diagonal argument to “diagonalize out” 
of the list Wo, W\, W 2 , ... of recursively enumerable sets. The set 
{a | a W„} cannot coincide with any W q . In fact this set is exactly K , 
the complement of the set K in Theorem 36F. Because 

q & K s —>- q ^ Wq, 

the set K cannot equal any W q ; the number q witnesses the inequality 
of the two sets K and W q . 

And there is more: Whenever W q is a recursively enumerable subset 
of K, that is, W q c K, then we can produce a number in K that is not 
in W q . Such a number is q itself. To see this, observe that in the line 
displayed in the preceding paragraph we cannot have both sides false 
(q e K and q e W q ) because W q C K. So both sides are true. 

Theorem 36F asserts that K, although recursively enumerable, is 
not recursive. To show non-recursiveness, it suffices to show that its 
complement K is not recursively enumerable. The preceding paragraph 
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does this in a particularly strong way, thereby giving us a second proof 
of Theorem 36F. 

At this point, let us reconsider the Godel incompleteness theorem, 
from the computability point of view. 

The set K is recursively enumerable (i.e., E|). It follows (cf. Theo¬ 
rem 35K) that K is arithmetical; that is, K is definable in the structure 9c 

So there is a formula x ( v \ ) with just V \ free that defines K in 91. 
And so the set K is defined in 91 by the formula -i x ( v \ ). Thus we have 

aeK (-ix(S fl 0)) e Th9t. 

This fact tells how we can “reduce” questions about membership in the 
set K to questions about Th91. Imagine that we are given a number a, 
and we want to know whether or not a e K. We can compute the number 
tt( —i *(S a 0)). (Informally, it is clear that we can effectively compute this 
number. Formally, we apply item 5 from Section 3.4 to make sure we 
can recursively compute the number.) If we somehow had an oracle for 
ft Th91 (i.e., a magic device that, given a number, would tell us whether 
or not that number was in (J Th 91), then we could answer the question 
“Is a e KT 

Now let us eliminate the magic. For sets A and B of natural numbers, 
we say that A is many-one reducible to B (in symbols, A < m B) iff there 
exists a total recursive function / such that for every number a, 

a G A \ ■ v / iu) g B . 

The earlier example tells us that K < m J) Th 91. More generally, the ar¬ 
gument shows that any arithmetical set is many-one reducible to JJ Th 91. 

Lemma 361 Assume that A and B are sets of natural numbers with 
A < m B. 

(a) If B is recursive, then A is also recursive. 

(b) If B is recursively enumerable, the A is also recursively 
enumerable, 

(c) If B is E„ for some n, the A is also E„ for that n. 

Proof. Part (a) is already familiar; it was, in different terminology, 
catalog item 2 in Section 3.3. 

Part (b) is essentially part (a) “plus a quantifier.” That is, be¬ 
cause B is recursively enumerable, we know that for some recur¬ 
sive binary relation Q , 

c e B 3bQ(c,b). 

If / is the total recursive function that many-one reduces A to 
B, then every number a. 


a e A 


f(a ) e B 


3 b[Q(f(a),b)]. 
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The part in square brackets is recursive (i.e., {(a, b) \ Q(f(a), £>)} 
is recursive), as in part (a) of the lemma. So we have A in the 
required form to be recursively enumerable. 

Part (c) is essentially part (a) “plus n quantifiers” and is proved 
like part (b). H 

Our reason for examining the particular set K is that it gives us the 
following consequence: 

Godel Incompleteness Theorem Th hi is not recursively axiomatiz- 
able. 

Proof. ThfR cannot be recursively enumerable, lest K be recur¬ 
sively enumerable, by the preceding lemma. But any recursively 
axiomatizable theory would be recursively enumerable (item 20 
of Section 3.4; also Theorem 351). H 

In starkest terms, the situation is this: Any recursively axiomatizable 
theory is recursively enumerable. But Th fR is not recursively enumer¬ 
able. So any recursively axiomatizable subtheory must be incomplete. 

It will be worth while to go over this proof again, but replacing 
negative statements (such-and-such a set does not have a particular 
property) by positive statements. 

Assume that T is any recursively axiomatizable subtheory of Th 01. 
(So by the above theorem, T is incomplete.) We want to lay our hands 
on a sentence demonstrating the incompleteness. 

We have made a total recursive function / that many-one reduces K 
to JJ Th 01, namely f (a) = JJ( -i x(S fl 0)); then for every a, 

a eK f(a) e flThOl. 

And f(a) is (the Godel number of) the sentence saying “a ^ K .” 
Consider the set J of numbers defined by the condition 

Cl G J \ ■ V / (A) G jjr. 

Thus J is the set of numbers that T “knows” are not in K. There are 
two observations to be made concerning J : 

First, J is recursively enumerable. It is many-one reduced by / to 
the recursively enumerable set JIT; apply Lemma 361(b). 

Secondly, / C K. We have T C Th 01, so if T knows that a K, 
then really a <£ K: 

ae J f(a) e $T=>f(a) e JJ Th OT a eK. 

So / is a recursively enumerable subset of K. It is a proper subset, 
because K is not recursively enumerable. That is, there is some number 
q with q € K andg ^ J . Consequently, f(q ) e flThOlbut f(q) #T. 
That is, the sentence (—■ *:(S^O)) is true (in 03) but fails to be in T, 
thereby demonstrating the incompleteness of T. 



258 


A Mathematical Introduction to Logic 


And what does this sentence “say”? For q, we can take any number 
for which W q = J. Then q e K and q J. 

Here then is the situation: 

(-i ^(S ? 0)) says q £ K 
i.e., q <£. W q 

i.e., q £ J since W q = J 

i.e., f(q) ttr by definition of J 

i.e., T \f (—. pc(S^O)). 

The sentence we made to witness the incompleteness of T asserts its 
own unprovability in the axiomatizable theory T ! 

The computability approach and the self-reference approach to 
Godel’s incompleteness theorem are not so different after all. Moreover, 
the computability approach is close to the diagonalization approach 
(of Section 3.0), but with the diagonal argument moved to a different 
context. 

Reduction of Decision Problems 1 

Suppose we have a two-place recursive partial function /. Then we 
claim that, for example, the function g defined by 

g(a) = /(3, a ) 

is also a recursive partial function. On the basis of informal computabil¬ 
ity this is clear; one computes g by plugging in 3 for the first variable 
and then following the instructions for /. A proof can be found by for¬ 
malizing this argument. There is some formula <p = (p(v i, i> 2 , tb) that 
weakly represents / (as a relation) in Cn Ae . Then g is weakly repre¬ 
sented by <£>(S 3 0, vi, vi), provided that v\ and v 2 are substitutable in (p 
for v 2 and (If not, we can always use an alphabetic variant of <p.) 

Now all this is not very deep. But by standing back and looking at 
what was said, we perceive a more subtle fact. We were able to transform 
effectively the instructions for / into instructions for g. So there should 
be a recursive function that, given an index for / and the number 3, 
will produce an index for g. The following formulation of this fact is 
sometimes known by the cryptic name of “the S-m-n theorem.” 

Parameter Theorem For each m > 1 and n > 1 there is a recursive 
function p such that for any e, a, b, 

Mm+ntei, • • ■ ,a m , bu - b„) = n _p(e, a u .. .,a m )J„(bi,b„). 

(Equality here means of course that if one side is defined, then so also 
is the other side, and the values coincide. Sometimes a special symbol 
is used for this role.) 


1 The remainder of this section can be skipped on a first reading. 
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On the left side of the equation a consists of arguments for the func¬ 
tion [[e]],„ + „; on the right side a consists of parameters upon which 
the function |}p(e, a)],, depends. In the example we had m = n — 1 
and = 3. Since p depends on m and n, the notation “p™” would be 
logically preferable. But, in fact, we will use simply “p.” 

Proof, for m — n = 1 . It is possible to give a proof along the lines 
indicated by the discussion that preceded the theorem. But to avoid 
having to cope with alphabetic variants, we will adopt a slightly 
different strategy. 

We know from the normal form theorem that the three-place 
partial function h defined by 

h{e, a, b ) = JeJklP, b) 

is a recursive partial function. Hence there is a formula x[r that 
weakly represents h (as a relation). We may suppose that in i [r the 
variables v\ and v 2 are not quantified. We can then take 

pie, a) = S°0, v\, v 2 ) 

= Sb(Sb(Sb(Sb(«Vr, #“«!, «S e 0), ttv 2 , tfS fl 0), ttW 3 , $V 4 , $V 2 ). 

Then pie, a) is the Godel number of a formula weakly represent¬ 
ing the function g(b) = [[e]] 2 («, b). Hence it is an index of g. 

H 


We will utilize the parameter theorem to show that certain sets are 
not recursive. We already know that K — {a | |[a]|i(a) is defined} is 
not recursive. For a given nonrecursive set A we can sometimes find a 
(total) recursive function g such that 

a e K g(a) e A 

or a (total) recursive function g' such that 

a <£ K <s> g'(ci) e A. 

In either case it then follows at once that A cannot be recursive lest K be. 
In the former case we have K < m A and A is not Id | (by Lemma 361); in 
the latter case K < m A and A is not E |. In either case, A is not recursive. 
The function g or g' can often be obtained from the parameter theorem. 

Example, [a \ W a = 0} is not recursive. 

Proof. Call this set A. First, note that A e ni, since W a — 0 
iff 'ib'ik {a, b, Ic) £ T\. Consequently, K cannot be many-one 
reducible to A, but it is reasonable to hope that K might be. That 
is, we want a total recursive function g such that 


Ifl]]i(fl) is undefined <£> domUgffl)]]! = 0. 
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This will hold if for all b , |[g(fl)]h (b) — Ia]]i(a). So start with 
the recursive partial function 

f(a, b) = Efl]]i(a) 

and let g(a) = p(f , a), where / is an index for /. Then 

IT g(a)Mb) = lp(f,a)Mb) = f(a,b) = Mr(«). 

Thus this g shows that K is many-one reducible to A. H 

Theorem 36J (Rice, 1953) Let C be a set of one-place recursive par¬ 
tial functions. Then the set { e | UeJ! e C } of indices of members 
of C is recursive iff either C is empty or C contains all one-place 
recursive partial functions. 

Proof. Only one direction requires proof. Let Ic = {e | leji e C } 
be the set of indices of members of C. 


Case I: The empty function 0 is not in C. If nothing at all is 
in C we are done, but suppose some function if/ is in C. We can 
show that K is many-one reducible to Ic if we have a recursive 
total function g such that 


Eg(a)It = 


i// if a e K, 
0 if a £ K. 


For then a e K <$■ Ig(fl)]]i eC<s> g(a) e Ic- 

We can obtain g from the parameter theorem by defining 


g{a) = p{e, a). 


where 


lie] ] 2 (a, b) = 


i /r{b) if a e K, 
undefined if a £ K. 


The above is a recursive partial function, since 
Ie ]]2 (a, b) = c<&aeK&\lr(b)=c 


and the right-hand side is recursively enumerable. 

Case II: 0 e C. Then apply case I to the complement C of C. We 
can then conclude that I- ( j is not recursive. But 1^ is the complement 
of Ic, so Ic cannot be recursive. 

Thus in either case, Ic is not recursive, H 


Examples. For any fixed e, the set {a | W a = W ,,) is not recursive, as 
a consequence of Rice’s theorem. In particular, [a \ W a = 0} is 
not recursive, a result proved in an earlier example. For two other 
applications of Rice’s theorem, we can say that {a | W a is infinite} 
and {a \ W a is recursive} are not recursive. 
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Register Machines 

There are many equivalent definitions of the class of recursive func¬ 
tions. Several of these definitions employ idealized computing devices. 
These computing devices are like digital computers but are free of any 
limitation on memory space. The first definition of this type was pub¬ 
lished by Alan Turing in 1936; similar work was done by Emil Post at 
roughly the same time. We will give here a variation on this theme, due 
to Shepherdson and Sturgis (1963). 

A register machine will have a finite number of registers, numbered 
1,2 ,..., K. Each register is capable of storing a natural number of 
any magnitude. The operation of the machine will be determined by a 
program. A program is a finite sequence of instructions, drawn from the 
following list: 

I r (where 1 < r < K). “Increment r.” The effect of this instruction 
is to increase the contents of register r by 1. The machine then proceeds 
to the next instruction in the program. 

D r (where 1 < r < K). “Decrement r.” The effect of this instruction 
depends on the contents of register r. If that number is nonzero, it is 
decreased by 1 and the machine then proceeds, not to the next instruc¬ 
tion, but to the following one. But if the number in register r is zero, the 
machine just proceeds to the next instruction. In summary: The machine 
tries to decrement register r and skips an instruction if it is successful. 

T q (where q is an integer-positive, negative, or zero). “Transfer q.” 
All registers are left unchanged. The machine takes as its next instruc¬ 
tion the gth instruction following this one in the program (if q > 0), or 
the \q |th instruction preceding this one (if q < 0). The machine halts if 
there is no such instruction in the program. An instruction of T 0 results 
in a loop, with the machine executing this one instruction over and over 
again. 

Examples 

1. Program to clear register 7. 

Try to decrement 7. 

Go back and repeat. 

Halt. 

2. Program to move a number from register r to register s. 

Clear register ,v (Use the program of the first example.) 

D r Take 1 from r. 

T 3 | Halt when zero. 

I 5 Add 1 to s. 

T — 3 Repeat. 
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This program has seven instructions altogether. It leaves a zero 
in register r. 

3. Program to add register 1 to registers 2 and 3. 

[-D 1 
T 4 - 
I 2 
I 3 
-T -4 


4. (Addition) Say that a and b are in registers 1 and 2. We want 
a + b in register 3, and we want to leave a and b still in registers 1 
and 2 at the end. 


Clear register 3. 

Move number from register 1 to register 4. 
Add register 4 to registers 1 and 3. 

Move number from register 2 to register 4. 
Add register 4 to registers 2 and 3. 


Register contents 
a b 0 

0 b 0 a 

aba 0 

a 0 a b 

a b a + b 0 


This program has 27 instructions as it is written, but three of 
them are unnecessary. (In the fourth line we begin by clearing 
register 4, which is already clear.) At the end we have the number 
a back in register 1. But during the program register 1 must be 
cleared; this is the only way of determining the number a. 

5. (Subtraction) Let a — b = max (a — b, 0). We leave this 
program to the reader (Exercise 11). 


Now suppose / is an n-place partial function on N. Possibly there 
will be a program P such that if we start a register machine (having all 
the registers to which P refers) with a\,... ,a n in registers \n and 
apply program P, then the following conditions hold: 


(i) If/(fli, .... a,,) is defined, then the calculation eventually termi¬ 
nates with f(ai,...,a n ) in register n + 1. Furthermore, the calculation 
terminates by seeking a (p +1 )st instruction, where p is the length of P. 

(ii) If f(a\, ... ,a n ) is undefined, then the calculation never termi¬ 
nates. 


If there is such a program P, we say that P calculates f. 

Theorem 36K Let / be a partial function. Then there is a program 
that calculates / iff f is a recursive partial function. 

Thus by using register machines we arrive at exactly the class of 
recursive partial functions, a class we originally defined in terms of 
representability in consistent finitely axiomatizable theories. The fact 
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that such different approaches produce the same class of partial functions 
is evidence that this is a significant class. 

Outline of Proof. To show that the functions calculable by reg¬ 
ister machines are recursive partial functions, one “arithmetizes 
calculations” in the same spirit as we arithmetized deductions in 
Section 3.4. That is, one assigns Godel numbers to programs and 
to sequences of memory configurations. One then verifies that the 
relevant concepts, translated into numerical relations by the Godel 
numbering, are all recursive. (After going through this, one per¬ 
ceives that, from a sufficiently general point of view, deductions 
and calculations are really about the same sort of thing.) 

Conversely, to show that the recursive partial functions are cal¬ 
culable by register machines, one can work through Sections 3.3 
and 3.4 again, but where functions were previously shown to be 
representable in Cn Ag, they must now be shown to be calculable 
by register machines. This is not as hard as it might sound, since 
after the first few pages, the proofs are all the same as the ones 
used before. There is a reason for this similarity. It can be shown 
that the class of all recursive functions is generated from a certain 
handful of recursive functions by the operation of composition (in 
the sense of Theorem 33L) and the “least-zero” operator (Theo¬ 
rem 33M). Much of the work in Sections 3.3 and 3.4 amounts 
to a verification of this fact. Thus once one has shown that each 
function in this initial handful is calculable by a register machine 
and that the class of functions calculable by register machines 
is closed under composition and the least-zero operator, then the 
earlier work can be carried over, yielding the calculability of all 
recursive functions. H 


Exercises 

1. Define functions / and g by 


f(n) 


g(n) 


0 if Goldbach’s conjecture is true, 

1 otherwise; 

0 if in the decimal expansion of tc there 
is a run of at least n consecutive 7’s, 

1 otherwise. 


Is / recursive? Is g recursive? (Goldbach’s conjecture says that 
every even integer greater than 2 is the sum of two primes. The first 
edition of this book used Fermat’s last theorem here.) 
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2. Define the “diagonal” function d(a) — I«]] 1 (a) + 1. 

(a) Show that d is a recursive partial function. 

(b) By part (a), we have d — [eji for a certain number e. So on 
the one hand, d ( e) — fle]] j ( e ) and on the other hand d ( e ) = 
M i (e) +1 • Can we cancel, to conclude that 0 = l? Suggestion: 
Use the special symbol to mean that either both sides of 
the equation are undefined, or both sides are defined and equal. 
Restate the argument in this notation. 

3. (a) Show that the range of any recursive partial function is recur¬ 

sively enumerable. 

(b) Show that the range of a strictly increasing (i.e., f(n) < 
f{n + 1)) total recursive function / is recursive. 

(c) Show that the range of a nondecreasing (i.e., f(n ) < f(n + 1)) 
total recursive function / is recursive. 

4. (a) Let A be a nonempty recursively enumerable subset of N. Show 

that A is the range of some total recursive function. 

(b) Show that any infinite recursively enumerable subset of N 
includes an infinite recursive subset. 

5. Show that every recursive partial function has infinitely many 
indices. 

6. Give an example of a function / and a number e such that for 
all a, 

f(a ) = U(pik ( e , a, k) e T\) 

but e is not the Godel number of a formula weakly representing in 
Cn Ae- 

7. Show that the parameter theorem can be strengthened by requiring 
p to be one-to-one. 

8. Recall that the union of two recursively enumerable sets is recur¬ 
sively enumerable (Exercise 7 of Section 3.5). Show that there is a 
total recursive function g such that W g ( a ,b) = W„ U Wt- 

9. Show that {a \ W a has two or more members) is in Z but not in 11 j. 

10. Show that there is no recursively enumerable set A such that {!«]] i | 
a e A) equals the class of total recursive functions on N. 

11. Give register machine programs that calculate the following func¬ 
tions: 

(a) Subtraction, a — b — max (a — b, 0). 

(b) Multiplication, a ■ b. 

(c) ma x(a,b). 

12. Assume that there is a register machine program that calculates the 
77 -place partial function /. Show that given any positive 
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integers ..., r„ (all distinct), p, and k. we can find a program 
Q such that whenever we start a register machine (having all the 
registers to which Q refers) with ci\,... ,a n in registers r\,..., r„ 
and apply program Q, then (i) if f{ci \,..., a „) is defined, then the 
calculation eventually terminates with f (a \,..., a n ) in register p, 
with the contents of registers 1,2,... ,k (except for register p) the 
same as their initial contents, and furthermore the calculation ter¬ 
minates by seeking a (q + l)st instruction, where q is the length 
of Q; (ii) if f (a \, ..., a„) is undefined, then the calculation never 
terminates. 

13. Let g : N " +1 -> N be a (total) function that is calculated by some 
register machine program. Let f{a\,...,a„) — pb[g(a i,..., 
a n , b ) = 0 ], where the right-hand side is undefined if no such b 
exists. Show that the partial function / can be calculated by some 
register machine program. 

14. Show that the following sets have the given location in the arithmeti¬ 
cal hierarchy. (In each case, the given location is the best possible, 
but we will not prove that fact.) 

(a) {e | [[eji is total) is n 2 . 

(b) {e | W e is finite} is £ 2 . 

(c) {e | W e is cofinite} is £ 3 . 

(d) {e | W e is recursive) is £ 3 . 

15. Let Tot — {e \ HeJi is total). Clearly Tot C K. Show that there is 
no recursive set A with 

Tot C. A C. K. 

Remark'. This result includes Theorems 36E and 36F; the proofs 
used there can be adapted here. 

16. (a) Show that each Tl 2 set of natural numbers is, for some number 

e, the set 

{a | W>3c 73(A <L b, c)}. 

(b) Show that the set {<2 | not Wb3c T^ia, a, b, c)} is £2 but 
not n 2 . 

*(c) Generalize parts (a) and (b) to show that for each n, there is a 
set that is £„ but not n„ . 

17. Assume that A is a set of natural numbers that is arithmetical but 
is not n,„. Use the argument of page 256 to show that JJThfR 
is not £,„. 

Remark: Exercises 16 and 17 give a proof of Tarski’s theorem 
(that ft Th DT is not arithmetical) from computability theory. 
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SECTION 3.7 _ 

Second Incompleteness Theorem 

Let us return once again to item 20 in Section 3.4. Suppose that we 
have a recursively axiomatizable theory T given by a recursive set A of 
axioms (i.e., JJ A is recursive). Then as in item 20 

a € ttr ■<=> 3d [d is the number of a deduction from A 
and the last component of d is a and a 
is the Godel number of a sentence]. 

The set of pairs (a, d) meeting the condition in brackets is recursive; let 
7 T (v |, V 2 ) he a formula — chosen in some natural way — that numeral- 
wise represents that binary relation in Ae- 

For any sentence cr, we can express ‘Thor” by the sentence 
3 ib 7r(S :o 0, Vi ). Let us give that sentence a name; define 

Prb r a = 3 U27r(S fl<T 0, Vi). 

(Here Prb abbreviates “provable.” The subscript should perhaps be “A” 
instead of “7”'; in constructing the sentence we utilize the recursiveness 
of the set A of axioms.) 

Lemma 37A Let T be a recursively axiomatizable theory as above. 

(a) Whenever T h a then Ae h Prb j a. 

(b) If in addition T includes Ae, then T has the “reflection” 
property: 

T \- o => T h Prbj- a. 

Proof. If T h a then we can let d be the number of a deduction 
of cr from the axioms A for T. We have Ae h jrtS^ O. S^O), 
and hence Ae h Prbj- cr. This gives part (a), from which part (b) 
follows immediately. H 

Thus under modest assumptions, whenever T proves a sentence, it 
knows that it proves the sentence. Note that part (b) does not say that 
T h (cr — r Prb/ cr). For example, if a is true (in Tt) but improvable 
from Ae, then the sentence (a —*• Prl)^ o ) is not provable from Ae, 
and in fact is false in fR. 

Returning now to the proof of the Godel incompleteness theorem 
(in the self-reference approach), we can apply the fixed-point lemma to 
obtain a sentence o asserting its own unprovability in T : 

Ae h (cr <—»■ —1 Prbr cr). 

The following lemma provides part of the incompleteness theorem (the 
other part being Exercise 2 in Section 3.5): 
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Lemma 37B Let T be a recursively axiomatizable theory including 
Ae and let a be obtained from the bxed-point lemma as above. If 
T is consistent, then T Vf a. 

Proof 

T h a => T\~ Prby cr by reflection 
=> T I- 1 a by choice of a 

whence T is inconsistent. H 

So far, this lemma merely reflects ideas employed in Section 3.5, and 
the proof of Lemma 37B was not very complex. And that is exactly the 
point: The proof is not very complex, so perhaps it can be carried out 
within the theory T, if T is “sufficiently strong.” That, we can hope that 
the steps 

Prb 7 - a —Prby Prby o 
—► Prby —i a 
—> Prby 0 = SO 

can be carried out in a sufficiently strong extension T of Ay. 

If so, we get a remarkable conclusion. Let Cons T be the sentence 
—i Prby 0 = SO, which we think of as saying “T is consistent.” (Here 
0 = SO is chosen simply as a convenient sentence refutable from A E .) 
If T lets us carry out the steps in the preceding paragraph, then we can 
conclude: 

T I/ Cons T. unless T is inconsistent 

(Of course, an inconsistent theory contains every sentence, including 
sentences asserting — falsely — the theory’s consistency. The situation 
we are finding here is that, under suitable assumptions, this is the only 
way that a theory can prove its own consistency.) Let’s check the details: 

Suppose T h Cons T. Then by the preceding paragraph, 7’ I- 1 Prby a. 

By choice of cr, we then get T h a. Lemma 37B then applies. 

To make matters less vague, call the theory T sufficiently strong if it 
meets the following three “derivability” conditions. 

1. Ae c T. This implies by Lemma 37A that T has the reflection 
property, T h a =>■ T h Prby cr. 

2. For any sentence a, T h (Prby cr —* Prby Prby cr). This is the 
reflection property, formalized within T. 

3. For any sentences p and o, T F (Prby ( p —► cr) —► (Prby p —r 
Prby a)). This is modus ponens, formalized within T. 

Formalized Lemma 37B Assume that T is a sufficiently strong re¬ 
cursively axiomatizable theory, and let a be a sentence such that 

Ae F (cr <—y -i Prby cr). 

Then T h (Cons T —*■ —> Prby cr). 
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Proof. We put the pieces together carefully. By the choice of a 
we get 

T b (cr ->• (Prb r cr -»• 0 = SO)). 

Applying first reflection and then formalized modus ponens to 
this formula yields 

T h (Prbr a —> Prb, (Prb, cr ->• 0 = SO)) 

after which another application of formalized modus ponens 
yields 

T h (Prb r a — *■ (Prbr Prbr a —*■ —> Cons T)). 

The formula displayed above (to the right of the turnstile), to¬ 
gether with Prb/ a —»• Prb r Prb/ cr (formalized reflection) imply 
by sentential logic Prb j o —» —i Cons T. H 

Godel's Second Incompleteness Theorem (1931) AssumethatT is 
a sufficiently strong recursively axiomatizable theory. Then 
T h Cons T if and only if T is inconsistent. 

Proof. If T h Cons T then by Formalized Lemma 37B we have 

7’ I- 1 Prb/ a whence by our choice of cr, we have T h a. We 

conclude from the (unformalized) Lemma 37B that T is incon¬ 
sistent. H 

We can squeeze a bit more out of these ideas. Lemma 37B can be 
regarded as a special case (where r is 0 = SO) of the following: 

Lemma 37C Let T be a recursively axiomatizable theory including 
Ae, let r be a sentence, and let a be obtained from the fixed-point 
lemma so that 


Ae h (cr *-y (Prbr <r —* r)). 

If T h a , then T hr. 

Proof. We can think of a as saying, “If I am provable, then r.” If 
T h a then by reflection T h Prbr cr. By the choice of cr, we 
have rhr. H 

Actually we are not interested in this lemma, but in its formalization: 

Formalized Lemma 37C Assume that T is a sufficiently strong re¬ 
cursively axiomatizable theory. Let r be a sentence, and let a be 
a sentence such that 

Ae h (cr •*-» (Prbr cr —> r)). 


Then T h (Prbr cr —>- Prbr t). 
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Proof. We proceed as before. By the choice of cr we get 
T h (er —» (Prb 7 - o —> r)). 

Applying first reflection and then formalized modus ponens to 
this formula yields 

T I- (Prb r a —*■ Prbr (Prb r a —> r)) 

after which another application of formalized modus ponens yields 

T I- (Prbj- a —*■ (Prbr Prbr a —*■ Prbr r)). 

The formula displayed above (to the right of the turnstile), to¬ 
gether with Prb 7 a —>• Prb r Prbr a (formalized reflection) imply 
by sentential logic Prbj- cr —► Prbr r. H 

Lob's Theorem (1955) Assume that T is a sufficiently strong re¬ 
cursively axiomatizable theory. If r is any sentence for which 
T h (Prbr r —*■ r), then T hr. 

Clearly if T \- r, then T h (/> —> r) for any sentence p. So the 
conclusion to Lob’s theorem can be stated 

T h (Prbr r -> t) T hr. 

Proof. Given the sentence r, we construct a to say, “If I am prov¬ 
able then r,” as above. Suppose that T h (Prbr x —*■ x )• By 
Formalized Lemma 37C we have T h (Prbr a —* Prbr r). By 
our choice of a , we conclude that T h cr . So by the (unformalized) 
Lemma 37C, we have T hr. H 

Lob’s theorem was originally devised in order to solve the problem 
given in Exercise 1. But it implies (and in a sense is equivalent to) 
Godel’s second incompleteness theorem. Assume that 7’ is a sufficiently 
strong axiomatizable theory. Applying Lob’s theorem and taking r to 
be 0 = SO, we have 

T h (Prbr (0 = SO) —+ 0 = SO) 7b0 = SO, 

that is, 

T h Cons T => T is inconsistent. 

Thus we obtain a proof of the second incompleteness theorem. 

But there is an issue not yet examined: What theories are sufficiently 
strong? Are there any at all (apart from the trivial case of the inconsistent 
theory)? 

Yes, and here are two. The first is called “Peano arithmetic” (PA). 
Its axioms consist of the Ae axioms, plus all the “induction axioms.” 
These are the universal closures of formulas having the form 


^(0) A Vx((p{x) —* <p( Sx)) ->V.t (p{x) 
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for a wff tp. The induction axioms — which state the ordinary principle of 
mathematical induction — enable us to carry out many arguments about 
the natural numbers (e.g., the commutative law of addition) within Peano 
arithmetic. But to be sure that formalized reflection and formalized 
modus ponens can be derived with Peano arithmetic, one must carry out 
the details, which we will not go through here. 

We know that Peano arithmetic is consistent, because it is true in 91. 
But by the second incompleteness theorem, PA cannot prove its own 
consistency. We “know” that PA is consistent by means of an argument 
we carry out either in informal mathematics, or — if we want — in set 
theory. So set theory has a higher “consistency strength” than PA: It 
proves the consistency of PA and PA does not. 

A second sufficiently strong theory is axiomatic set theory. Or to be 
more careful, it is the set of sentences in the language of number theory 
that are provable in axiomatic set theory. The next subsection deals with 
this situation. This theory has the advantage that it is quite believable — 
on an informal level — that formalized reflection and formalized modus 
ponens are derivable. But what are our grounds for thinking that set 
theory is consistent? We know that PA is consistent because it is true in 
the “standard model” 91 of number theory. It is not at all clear that we 
can meaningfully speak of a “standard model of set theory”! 


Applications to Set Theory 

We know that in the language of number theory, Cn Ae is incomplete 
and nonrecursive, as is any compatible recursively axiomatizable theory 
in the language. 

But now suppose we leave arithmetic for a while and look at set the¬ 
ory. Here we have a language (with the parameters V and e) and a set 
of axioms. In all presently accepted cases the set of axioms is recursive. 
Or more precisely, the set of Godel numbers of the axioms is recursive. 
And so the theory (set theory) obtained is recursively enumerable. We 
claim that this theory, if consistent, is not recursive and hence not com¬ 
plete. We can already sketch the argument in rough form. We can, in a 
very real sense, embed the language of number theory in set theory. We 
can then look at that fragment of set theory which deals with the natu¬ 
ral numbers and their arithmetic (the shaded area in Fig. 14). That is a 
theory compatible with A K . And so it is nonrecursive. Now if set theory 
were recursive, then its arithmetical part would also be recursive, which 
it is not. As a bonus, we will come across the second incompleteness 
theorem for the case of set theory. 

Henceforth by set theory (ST) we mean that theory (in the language 
with equality having the two parameters V and e) which is the set of con¬ 
sequences of the reader’s favorite set-theoretic axioms. (The standard 
Zermelo-Fraenkel axioms will do nicely, if the reader has no favorite. 
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Figure 14. Set theory and number theory, (a) Flat picture, (b) A more accurate 
picture. 


We ask only that the set of axioms be recursive, and that it be strong 
enough to yield certain everyday facts about sets.) We need an inter¬ 
pretation it of Cn Ae into ST. (The remainder of this section assumes a 
familiarity with Section 2.7.) But the existence of such a it is a standard 
result of set theory, although it is not usually stated in these words. We 
need formulas of the language of ST that adequately express the concept 
of being a natural number, being the sum of two given numbers, and so 
forth. To find these formulas, we turn to the way in which the arithmetic 
of natural numbers can be “embedded” in set theory. That is, on the one 
hand, natural numbers such as 2 or 7 do not appear to be sets. On the 
other hand, we can, when we choose, select sets to represent numbers. 
The standard approach is to take 0 to be the set 0 and n + 1 to be the 
set n; n. This has the fringe benefit that each number is the set of all 
smaller numbers (e.g., 3 e 7). Let a> be the collection of all these sets 
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(these “number-sets”); thus co is the set representing N. 

The formula 7Tv is the result of eliminating the defined symbol co 
from the formula V\ e co. The formula jto is similarly obtained from 
the set-theoretic formula V\ — 0, and the formula itg is obtained from 
V2 — v\ U {ui}. The formula 7T< is simply v\ e Vo. For n + we use the 
translation into the language of ST of 

For any /, if / : o> x co —» co and for all a and b 
in co we have f (a, 0) = a 
and f(a, b U {/?}) = f(a, b) U {f(a, b)}, 
then f(v i, V 2 ) = 173 . 

(The manner of translation is partially indicated in Chapter 0.) The 
formulas 7r and 7 Te are obtained in much the same fashion. 

The claim that this n is an interpretation of Cn A E into ST makes a 
number (and the number is 17) of demands on ST. 

(i) 3 V\ix\/ must be in ST. It is, since we can prove in set theory that 
co is nonempty. 

(ii) For each of the five function symbols f in the language of 
Ae, ST must contain a sentence asserting, roughly, that 7 rf defines a 
function on the set defined by n\/. (The exact sentence is set forth in 
the definition of interpretation in Section 2.7.) In the case of 0, we have 
in ST the result that there is a unique empty set and that it belongs to 
co. The case for S is simple, since its defines a unary operation on the 
universe of all sets, and co is closed under this operation. For + we 
must use the recursion theorem on co. That is, we can prove in ST (as 
sketched in Section 1.4) that there is a unique / : co x co —> 00 such that 
f(a, 0) = a and f (a, b U {/?}) = f{a, b) U {/(a, b)} for a, b in co. 
The required property of 7r + then follows. Similar arguments apply to • 
and E. 

(iii) For each of the 11 sentences a in A E , the sentence must be 
in ST. For example, in the case of L3, we have in ST the fact that for 
any m and n in co, either m e n, m — n, or n e m. 

Since these demands are finite in number, there is a finite $ C ST 
such that 7 r is also an interpretation of Cn A E into Cn <t>. 

Theorem 37D (Strong Undecidability of Set Theory) Let T be a 

theory in the language of set theory such that T U ST (or at least 
T U O) is consistent. Then is not recursive. 

Proof. Let A be the consistent theory Cn(7’ U <t>). Let Ao be the 
corresponding theory 7T~' [ A] in the language of number theory. 
From Section 2.7 we know that Ao is a consistent theory (since 
A is). Also A e c Ao, since if a e A E , then u” e Cn$ c A. 
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Hence by the strong undecidability of Cn A E (Theorem 35C), f A () 
is not recursive. 

Now we must derive the nonrecursiveness of T from that of 
Aq. We have 


a e A 0 iff a n e A 

and by the lemma below, $cr 7r depends recursively on flcr. That is, 
JJAo < m tJ A. Hence (J A cannot be recursive, lest JJAo be. Similarly, 
we have 


r e A iff (<p —> r) e T, 

where tp is the conjunction of the members of <t>. Since ft (<p —»• r) 
depends recursively on JJr, we have JJ A < m tf T so that ft T cannot 
be recursive lest flA be. H 

L emma 3 7 E There is a recursive function p such that for any formula 
a of the language of number theory, p{ JJa) = JJ (a 71 ). 

Proof. In Section 2.7 we gave explicit instructions for construct¬ 
ing a n . The construction in some cases utilized formulas for 
formulas fi simpler than a. The methods of Sections 3.3 and 3.4 
can be applied to the Godel numbers of these formulas to show 
that p is recursive. But the details are not particularly attractive, 
and we omit them here. H 

Corollary 37F If set theory is consistent, then it is not complete. 

Proof. Set theory has a recursive set of axioms. If complete, the 
theory is then recursive (by item 21 of Section 3.4). By the fore¬ 
going theorem, this cannot happen if ST is consistent. H 

Corollary 37G In the language with equality and a two-place 
predicate symbol, the set of (Godel numbers of) valid sentences 
is not recursive. 

Partial Proof. In the foregoing theorem take T — Cn0, the set of 
valid sentences. The theorem then assures us that J \T is nonrecur¬ 
sive, provided that <I> is consistent. We have not given the finite 
set <I> explicitly. But we assure the reader that O can be chosen in 
such a way as to be provably consistent. H 

It should be noted that n is not an interpretation of Th Ti into ST 
(unless ST is inconsistent). For zr 1 [ST] is a recursively enumerable 
theory in the language of Tt, as a consequence of Lemma 37E. Hence 
it cannot coincide with Th Tt, and it can include the complete theory 
Th Tt only if it is inconsistent. 



274 


A Mathematical Introduction to Logic 


Godel's Second Incompleteness Theorem 
for Set Theory 

We can employ our usual tricks to find a sentence a of number theory 
which indirectly asserts that its own interpretation a n is not a theorem 
of set theory. For let D be the ternary relation on N such that 

(a. b, c) e D iff a is the Godel number of a formula a of number 
theory and c is the Godel number of a deduction 
from the axioms of ST of «(S , '0) 7r . 

The relation D is recursive (by the usual arguments); let S(v\, v 2 , ) 

represent D in Cn A E . Let r be the Godel number of 

V vo, — 1 5(ui, v\, V 2 ) 

and let a be 


V Vi — 1 5 (S' 0, S r 0, Vi). 

Observe that a does indirectly assert that a 71 £ ST. We will now prove 
that the assertion is correct: 

Lemma 37H If ST is consistent, then a n £ ST. 

Proof. Suppose to the contrary that a w is deducible from the ax¬ 
ioms of ST; let k be Q of such a deduction. Then (r, r, k) e D. 

A e h5(S' 0,S r 0, S A 0); 

.-. A e I- 3 U3<$(S r 0, S' O, V3); 


Applying our interpretation n, we conclude that —i o 11 is in ST, 
whence ST is inconsistent. Thus 

ST is consistent =>■ a n ^ ST. -\ 

Now the above proof, like all those in this book, is carried out in 
informal mathematics. But all of our work in the book could have been 
carried out within ST. Indeed it is common knowledge that essentially 
all work in mathematics can be carried out in ST. Imagine actually doing 
so. Then instead of a proof of an English sentence, “ST is consistent 
=>■ a n ST,” we have a deduction from the axioms of ST of a certain 
sentence in the formal language of set theory: 

(Cons(ST) —> □). 

Here Cons(ST) is the result of translating (in a nice way) “ST is 
consistent” into the language of set theory. Similarly, □ is the result of 
translating “er 71 ^ ST.” But we already have a sentence in the language 
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of set theory asserting that a 77 ^ ST. It is a 71 . This strongly suggests 
that □ is (or is provably equivalent in ST to) a n , from which we get 

(Cons(ST) —> a 71 ) 

as a theorem of ST. 

Now this can actually be carried out in such a way as to have □ be 
o 71 . We have given above an argument, which we hope will convince 
the reader that this is at least probable. And from it we now have the 
result: 

Godel's Second Incompleteness Theorem for Set Theory The sen¬ 
tence Cons (ST) is not a theorem of ST, unless ST is inconsistent. 

Proof. By the above (plausibility) argument 
(Cons(ST) —> a 71 ) 

is a theorem of ST. So if Cons(ST) is also a theorem of ST, then a 77 
is, too. But by Lemma 37H, if a 71 e ST, then ST is inconsistent. 

H 

Of course if ST is inconsistent, then every sentence is a theorem, 
including Cons(ST). Because of this, a proof of Cons(ST) within ST 
would not convince people that ST was consistent. (And by Godel’s 
second theorem, it would convince them of the opposite.) But prior to 
Godel’s work it was possible to hope that Cons(ST) might be provable 
from assumptions weaker than the axioms of set theory, ideally assump¬ 
tions already known to be consistent. But we now see that Cons (ST) is 
not in any subtheory of ST, unless of course ST is inconsistent. 

We are left with the conclusion that any recursively axiomatizable 
theory of sets (provided it meets the desirable conditions of being 
consistent and strong enough to prove everyday facts) is an incom¬ 
plete theory. This raises a challenge: to find additional axioms to add 
to the theory. On the one hand, we want the additional axioms to 
strengthen the theory in useful ways. On the other hand, we want the 
additional axioms to reflect accurately our informal ideas about what 
sets really are and how they really behave. 


Exercises 

1. Let a be a sentence such that 

A e L (ct <-» Prb Aj! CT). 

(Thus cr says “I am provable,” in contrast to the sentence “I am 
unprovable” that has been found to have such interesting properties.) 
Does A e h cr? 
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2. Let T be a theory in a recursively numbered language, and assume 
that there is an interpretation of Cn A E into T. Show that T is strongly 
undecidable; i.e., whenever T is a theory in the language for which 
T U T is consistent, then ft T is not recursive. 

SECTION 3.8 _ 

Representing Exponentiation 1 

In Sections 3.1 and 3.2 we studied the theory of certain reducts of 91 
and found them to be decidable. Then in Section 3.3 we added both 
multiplication and exponentiation. The resulting theory was found (in 
Section 3.5) to be undecidable. Actually it would have been enough 
to add only multiplication (and forego exponentiation); we would still 
have undecidability. 

Let 9 Tm be the reduct of 91 obtained by dropping exponentiation: 

91 M = (N; 0, S , <, +, •)• 

Thus the symbol E does not appear in the language of 91m- Let Am be 
the set obtained from A E by dropping El and E2. The purpose of this 
section is to show that all the theorems of Sections 3.3-3.5 continue to 
hold when “A E ” and “91” are replaced by “Am” and “91m” The key fact 
needed to establish this claim is that exponentiation is representable in 
Cn Am- That is, there is a formula s in the language of 91m such that 
for any a and b, 

A m h Vz[e(S n 0, S b 0, z) z = S (fli) 0]. 

Thus s(x, y, z) can be used to simulate the formula xEy = z without 
actual use of the symbol E. 

If we look to see what relations and functions are representable in 
Cn Am. we find at first that everything (except for exponentiation itself) 
that was shown to be representable in Cn A E is (by the same proof) 
representable in Cn A M - Until, that is, we reach item 7 in the catalog 
listing of Section 3.3. To go further, we must show that exponentiation 
itself is representable in Cn Am- 

We know that exponentiation can be characterized by the recursion 
equations 

a 0 = 1, 


1 This section may be omitted without loss of continuity. 
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From what we know about primitive recursion (catalog item 13 in 
Section 3.3 plus Exercise 8 there), we might think of defining 

E*(a, b ) = the least s such that [(5)0 = 1 and 
for all i < b , (s ) i+ 1 = ( s) t ■ a]. 

For then a b = (E*(a, b))i,. This fails to yield a proof of representabil- 
ity, because we do not yet know that the decomposition function ( a)b 
is representable in CnA^. But we do not really need that particular 
decomposition function (which corresponded to a particular way of 
encoding sequences). All we need is some function <5 that acts like a 
decomposition function; the properties we need are summarized in the 
following lemma. 

Lemma 38A There is a function 8 representable in Cn Am such that 
for every n, ag, . .., a n , there is an s for which 8(s, i) = fl; for 
all i < n. 

Once the lemma has been established, we can define 

E**(a , b) — the least s such that [<5(s, 0) = 1 and 

for all i < b, 8(s , i + 1) = 8(s , i) ■ a]. 

The lemma assures us that such an s exists. £** is then representable in 
Cn Am, as is exponentiation, since 

a b = 8{E**{a, b), b). 

A function <5 that establishes the lemma will be provided by some facts 
of number theory. 

A Pairing Function 

As a first step toward proving the foregoing lemma, we will construct 
functions for encoding and decoding pairs of numbers. It is well known 
that there exist functions mapping N x N one-to-one onto N. In partic¬ 
ular, the function J does this, where in the diagram shown, J(a, b) has 
been written at the point with coordinates (a, b). 


6 — 

l\l 

3 — 7 — 

l\l\l 

1—4 — 8—- 

[\l\l\l 

0 — 2 — 5 — 9 — 

X X X X 

—/!-■ ,Jj tA 

^ V ^o> 
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For example, 7(2, 1) = 8 and 7(0, 2) = 3. To obtain an equation 
for 7 (a, b), we note that along the line x + y — n there are n + 1 points 
(with coordinates in N). Thus 

J {a. b) = the number of points in the plane to which 7 assigns smaller 
values 

= [the number of points on lines x + y — n for 

n = 0, 1,..., (a + b — 1)] + [the number of points on the 
line x + y = a + b for which x < a] 

— [1 + 2 + • • • + (a + (?)] + ci 

= ^ (q -(- b) (o T- b T - 1) “b ci 
= \[(a + b) 2 + 3a + b]. 

Let K and L be the corresponding projection functions onto the axes, 
i.e., the unique functions such that 

K(J(a,b)) — a, L(J(a.b)) — b. 

For example, K (7) = 1, the x -coordinate of the point (1, 2} in the plane 
to which 7 assigned the number 7. Similarly, L (7) = 2, the y-coordinate 
of that point. 

We claim that 7, K, and L are representable in Cn A M . The function 

H(a) — the least b such that a < 2b 

has the property that H(a) — \a for even a. Then we can write 

7(a, b ) = H((a + b) ■ (a + b + 1)) + a, 

K(p ) = the least a such that [for some b < p, J {a , b) = p], 
L(p) — the least b such that [for some a < p, J (a , b) — p]. 

From the form of the four preceding equations we conclude that 77, 7, 
K, and L are representable in Cn Am- 

The Godel /3-function 

Let p be the function defined as follows: 

/3(c, d,i) — the remainder in c H- [1 + O’ + 1) • d] 

— the least r such that for some q < c. 
c — q ■ [1 + (i + 1) • 7] + r. 

This unlikely-looking function produces a satisfactory decomposition 
function for Lemma 38A. Let 

<50?, i) = /3(K(s ), L(s), i). 

It is clear that 8 is representable in Cn Am- What is not so obvious is 
that it meets the conditions of Lemma 38A. We want to show: 

For any n and any oq, ... ,a„, there are numbers 
c and d such that /l(c, d,i) — a\ for all i < n. 

For then it follows that S(J(c, d), i) — /3(c, d , i) = a, for i < n. 
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Now (*) is a statement of number theory, not logic. The proof of (*) 
is based on the Chinese remainder theorem. Numbers do,... ,d n are 
said to be relatively prime in pairs iff no prime divides both d, and d ; 

for i ± j. 

Chinese Remainder Theorem Let do,, d„, be relatively prime in 
pairs; let ao, ..., a„ be natural numbers with each a, < dj. Then 
we can find a number c such that for all i < n, 

at — the remainder in c d,. 

Proof. Let p — n,< f ,d,, and for any c let F(c) be the (« + l)-tuple 
of remainders when c is divided by do, ..., d„. Notice that there 
are p possible values for this (n + 1)-tuple. 

We claim that F is one-to-one on (A: | 0 < k < p\. For suppose 
that F(c i) = F(c 2 ). Then each dj divides |ci — C 2 I - Since the d, ’s 
are relatively prime, p must divide |ci — C 2 I - For c 1 , ci less than 
p, this implies that c 1 = 02 - 

Hence the restriction of F to \k | 0 < k < p \ takes on all p 
possible values. In particular, it assumes (at some point c) the 
value (ao,..., a„). And that is the c we want. H 

Lemma 38B For any s > 0, the .v + 1 numbers 

1+1-s!, 1+2-s!, ..., l + (s+l)-sl 
are relatively prime in pairs. 

Proof. All these numbers have the property that any prime factor 
q cannot divide 5 !, whence q > .y. If the prime q divides both 
1 + j ■ s ! and 1 + k ■ s !, then it divides their difference, | j — k\ ■ s !. 
Since q does not divide s !, it divides | j — k\. But | j — k\ < s < q. 
This is possible only if \j — k\ = 0. H 

Proof of (*). Assume we are given a 0 , ..., a n \ we need numbers 
c and d such that the remainder when c is divided by 1 + (i + 1) • d 
is ai, for i < n. 

Let .v be the largest of {«, ao,_> ««} an d let d = s !. Then by 

Lemma 38B, the numbers 1 + (i + 1) • d are relatively prime in 
pairs for i <n. So by the Chinese remainder theorem there is a c 
such that the remainder in c [1 + (i + 1) • d] is a,- for i < n. 

H 


This completes the proof of Lemma 3 8A. And by the argument that 
followed that lemma, we can conclude: 

Theorem 38C Exponentiation is representable in Cn Am- 

Armed with this theorem, we can now return to catalog item 7 of 
Section 3.3. The proof given there now establishes that the function in 
question (whose value at n is p n ) is representable in Cn Am - For it was 
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TABLE X 


Structure 

Theory 

Models of the theory 

Definable sets 

Comments 

(N) 

Decidable. Not finitely 
axiomatizable. Admits 
elimination of quantifiers. 

Any infinite set. 

0 and N. 

{0} is not definable. 


(N; 0) 

As above. 

Any infinite set with 
distinguished element. 

0, {0}, N — {0}, N. 

S is not definable. 


(N; 0, S) 

As above. 

Standard part plus any 
number of Z-chains. 

Finite and cofinite sets. 

< is not definable. 

{0} is definable in (N; S). 

(N; 0, 5, <) 

Decidable. Finitely 
axiomatizable. 

Admits elimination of 
quantifiers. 

As above, with any 
ordering of the Z-chains. 

Finite and cofinite sets. 

+ is not definable. 

{0} and 5 are definable 
in (N; <). 

(N; 0, S, <, +) 

Decidable (Presburger). 

The Z-chains are densely 
ordered without 
endpoints. 

Also there is a suitable 
addition operation. 

Eventually periodic sets. 

■ is not definable. 

{0}, 5, and < are 
definable in (N; +). 

(N; 0, 5, <,+,■) 

Not arithmetical. 

.". not recursively 
axiomatizable. 

As above, but with a 
suitable multiplication 
operation. 

All arithmetical relations 

are definable. 

The arithmetical relations 

are definable in 

(N; 5, •), (N; +, •), 
and (N; <, D ), where 
D(x, y) = ( x) y . 
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formed by allowable methods from relations and functions (including 
exponentiation) known to be representable in Cn Am- 

The same phenomenon persists throughout Sections 3.3 and 3.4. 
The representability proofs given there now establish representability 
in Cn Am- Thus any recursive relation is representable in Cn Am, and if 
the relation happens to be a function, then it is functionally representable. 
The proofs given in Section 3.5 then apply to *31 m and ,4 M as well as to 
*Tt and Ay. In particular, we have the strong undecidability of Cn 4 W : 
Any theory T in the language of 91 m for which 7’ U 4 W is consistent 
cannot be recursive. 

Notice that any relation definable in 91 (i.e., any arithmetical relation) 
is also definable in 91 m • For exponentiation, being representable in a 
subtheory of T1 i9Im, is a fortiori definable in 91 m- By the new version 
of Tarski’s theorem, TfhhLv/ is not definable in 91 m, and consequently 
;Th91v/ cannot be arithmetical. 

In the terminology of Section 2.7, we can say that there is a faithful 
interpretation of Th 91 into Th 91 m ■ It equals the identity interpretation 
on all parameters except E, and to E it assigns a formula defining ex¬ 
ponentiation in 91m ■ 

In Table X we summarize some of the results of Chapter 3 on number 
theory and its reducts. 


Exercises 

1. Let D(a, b) — ( a)b■ Show that any arithmetical relation is definable 
in the structure (N; <, D). Remark: One may well ask why T1 i91a, 
arithmetic with addition, is decidable (as shown in Section 3.2), while 
T1 i91m, the theory of arithmetic with addition and multiplication, is 
undecidable. One answer is that, as this section shows, multiplication 
lets us do a certain amount of sequence coding and decoding. The 
point of this exercise is to show that once we have the decoding 
function D and ordering, we have the full complexity of arithmetic 
with addition, multiplication, and exponentiation. 

2. Show that the addition relation {{ a,b,c) \ a + b = c} is definable in 
the structure (N; 5, •)• Suggestion: Under what conditions does the 
equation S(ac ) • S(bc ) = S(c ■ c ■ S(ab)) hold? 

3. (a) Show that Th(Z; +, •) is strongly undecidable. (See Exercise 2 

of Section 3.7.) 

(b) (This part assumes a background in algebra.) Show that the the¬ 
ory of rings is undecidable and that the theory of commutative 
rings is undecidable. 
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SECTION 4.1_ 

Second-Order Languages 

We can obtain — at a cost — richer, more expres¬ 
sive languages than the first-order languages con¬ 
sidered thus far, by allowing quantification of 
predicate or function symbols. For example, 

3x(Px —> V.i Px) 

is a valid formula having V and P as its parameters. 
Because it is true no matter how P is interpreted, 

V P 3 x(Px Px) 

deserves to be called valid. (Now V is the only pa¬ 
rameter, since P is treated as a predicate variable.) 

Suppose, then, that we have in addition to the 
symbols introduced at the beginning of Section 2.1, 
the further logical symbols: 

4. Predicate variables: For each positive integer 
n we have the 77 -place predicate variables 

Xl,X n 2 ,.... 

5. Function variables: For each positive integer 
7z, we have the 77 -place function variables 

F'l, F 2, .... 

The usual variables V\, V2, ... will now be called 
individual variables, to avoid confusion. The terms 
are as before defined as the expressions that can be 
built up from the constant symbols and the 
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individual variables by applying the function symbols (both the func¬ 
tion parameters and the function variables). Atomic formulas are again 
expressions Pt\ ■ ■ ■ t„, where t\,... ,t n are terms and P is an n-place 
predicate symbol (parameter or variable). The definition of wff is aug¬ 
mented by new formula-building operations: If cp is a wff, then so also 
are VA - " q> and VF" q>. The concept of a variable occurring free in (p 
is defined just as before. A sentence is a wff er in which no variable 
(individual, predicate, or function) occurs free. 

It should be remarked that the roles played by predicate parame¬ 
ters and free predicate variables are essentially the same. There is the 
same close relationship between constant symbols and free individual 
variables, and between function parameters and free function variables. 

By a structure we continue to mean a function on the set of parame¬ 
ters meeting the conditions set forth in Section 2.2. We must extend the 
definition of satisfaction in the natural way. Let V now be the set of all 
variables, individual, predicate, or function. Let s be a function on V that 
assigns to each variable the suitable type of object. Thus s(vi) is a mem¬ 
ber of the universe, s(X" ) is an n -ary relation on the universe, and s(F ") 
is an n-ary operation. For a term t,s(t ) is defined in the natural way. In 
particular, if F is a function variable, then s(Ft\ ■■■/,,) is the result of 
applying the function s(F ) to (s(L), ..., s(t„)). Satisfaction of atomic 
formulas is also defined essentially as before. For a predicate variable X, 

\=<n Xti ■ ■ ■ t n [s] iff {s(h),... ,s(t„)) es(X). 

The only new features in the definition of satisfaction arise from our 
new quantifiers. 

5. |= 2 | \fX n j q>[s] iff for every n -ary relation R on |2l|, we have 

N* vW I R)l 

6. |= 2 i VF;>W iff for every function / : |2l|" —»■ |2l|, we have 

N* <p[s(F’l | /)]. 

Again it is easy to see that only the values of s at variables occurring 
free in the formula are significant. For a sentence a, we may unam¬ 
biguously speak of its being true or false in 21. Logical (semantical) 
implication is defined exactly as before. 

Example 1 . A well-ordering is an ordering relation such that any 
nonempty set has a least (with respect to the ordering) element. 
This last condition can be translated into the second-order 
sentence 

VX( 3 yXy — 3 y(Xy A V z(Xz -► y < z»). 

Here, as elsewhere, we omit the subscripts on X and F when they 
are immaterial, and we omit the superscripts if they are clear from 
the context. 
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Example 2. One of Peano’s postulates (the induction postulate) 
states that any set of natural numbers that contains 0 and is closed 
under the successor function is, in fact, the set of all natural num¬ 
bers. This can be translated into the second-order language for 
number theory as 

VX(X0 A V y(Xy -> XSy) —> V>> Xy). 

Any model of S1, S2, and the above Peano induction postu¬ 
late is isomorphic to (N; 0, .S'); see Exercise 1. Thus this set of 
sentences is categorical; i.e., all its models are isomorphic. 

Example 3. For any formula <p in which the predicate variable X" 
does not occur free, the formula 


3X n Vvi---Vv n [X n vi---v„ *-+</>] 


is valid. (Here other variables may occur free in <p in addition to 
Vi ,..., v n .) It says that there exists a relation consisting of exactly 
the n-tuples satisfying <p. Formulas of this form are called relation 
comprehension formulas. There are also the analogous function 
comprehension formulas . If \f is a formula in which the variable 
F" does not occur free, then 

V i?i • ■ ■ V v„ 3\v n+ \ if —>• 

3F n V ■ • • V v n+ i (F'vi ■■■v n — v n+ i ^ f) 

is valid. (Here “3!u „ +1 i//” is an abbreviation for a formula ob¬ 
tained from Exercise 21 of Section 2.2.) 

Example 4. In the ordered field of real numbers, any bounded non¬ 
empty set has a least upper bound. We can translate this by the 
second-order sentence 

VX[3yVz(Xz^z<y)/\3zXz^ 

3 y\f y'(y z(Xz —> z < /) •*->}’< /)]. 

It is known that any ordered field that satisfies this second-order 
sentence is isomorphic to the ordered field of reals. 

Example 5. For each n > 2, we have a first-order sentence which 
translates, “There are at least n things.” For example, M is 

3 x 3 y 3 z(x ^ y A x ^ z A y ^ z). 

The set {X 2 , A. 3 ,...} has for its class of models the EC a class 
consisting of the infinite structures. There is a single second- 
order sentence that is equivalent. A set is infinite iff there is an 
ordering on it having no last element. Or more simply, a set is 
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infinite iff there is a transitive irreflexive relation R on the set 
whose domain is the entire set. This condition can be translated 
into a second-order sentence /.^: 

3T([V mVvV w{Xuv —»•Xvw —> Xuw) A V u —i Xuu A 
Vm 3 v Xu v]. 

Another sentence (using a function variable) that defines the class 
of infinite structures is 

3 F[V x V y(Fx — Fy —y x = y) A 3 z V x Fx ^ z ], 
which says there is a one-to-one function that is not onto. 

The preceding example shows that the compactness theorem fails for 
second-order logic: 

Theorem 41A There is an unsatisfiable set of second-order sen¬ 
tences every finite subset of which is satisfiable. 

Proof. The set is, in the notation of the above example, 

{ 1 ^-oo? ^ 2 , A-3, . . .}. _j 

The Lowenheim-Skolem theorem also fails for second-order logic. 
By the language of equality we mean the language (with —) having no 
parameters other than V. A structure for this language can be viewed 
as being simply a nonempty set. In particular, a structure is determined 
to within isomorphism by its cardinality. A sentence in this language is 
therefore determined to within logical equivalence by the set of cardi¬ 
nalities of its models (called its spectrum ). 

Theorem 41 B There is a sentence in the second-order language of 
equality that is true in a set iff its cardinality is 2 K °. 

Proof, Using Concepts from Algebra and Analysis. Consider first 
the conjunction of the (first-order) axioms for an ordered field, 
further conjoined with the second-order sentence expressing the 
least-upper-bound property (see Example 4 of this section). This 
is a sentence whose models are exactly the isomorphs of the real 
ordered field (i.e., the structures isomorphic to the ordered field 
of real numbers). We now convert the parameters 0, 1, +, •, < to 
variables (individual, function, or predicate as appropriate) which 
we existentially quantify. The resulting sentence has the desired 
properties. 3 

There are other cardinal numbers with second-order characteriza¬ 
tions of this sort; cf. Exercise 2. 
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Theorem 41C The set of Godel numbers of valid second-order sen¬ 
tences is not definable in 02 by any second-order formula. 

Here we assume that Godel numbers have been assigned to second- 
order expressions in a manner like that used before. Although our proof 
applies to the second-order language of number theory, the theorem is 
true for any recursively numbered language having at least a two-place 
predicate symbol. 

Proof. Let T 2 be the second-order theory of 02, i.e., the set of 
second-order sentences true in 02. The same argument used to 
prove Tarski’s theorem shows that CT 1 is not definable in 02 by 
any second-order formula. 

Now let a be the conjunction of the members of Ae with the 
second-order Peano induction postulate (Example 2). Any model 
of a is isomorphic to 02; cf. Exercise 1. Consequently, for any 
sentence a, 

a e T 2 iff (a — *■ a) is valid. 

Consequently, the set of (Godel numbers of) validities cannot be 
definable lest $T 2 be. H 

A fortiori, the set of Godel numbers of second-order validities is 
not arithmetical and not recursively enumerable. That is, the enumer¬ 
ability theorem fails for second-order logic. (In the other direction, 
one can show that this set is not definable in number theory of or¬ 
der three, or even of order to. But these are topics we will not enter into 
here.) 

It is interesting to compare the effect of a second-order universal 
sentence, such as the Peano induction postulate 

VZ(X0 A V y(Xy -> XSy) -*• VyXy ) 

and the corresponding first-order “schema,” i.e., the set of all sentences 

0(0) A V y(<p(y) —> <p(S v)) —► Vy <p(y), 

where <p is a first-order formula having just v t free. If 21 is a model of 
the Peano induction postulate, then any subset of |2l| containing O'' 1 and 
closed under S' 1 is in fact all of 1211. On the other hand, if 21 is a model of 
the corresponding axiom schema, we can say only that every definable 
subset of 1211 containing O' 21 and closed under S sl is all of |2l|. There 
may well be undefinable subsets for which this fails. (For example, 
take any model 21 of Th(M: 0, S ) having Z-chains. Then 21 satisfies 
the above first-order schema, but it does not satisfy the second-order 
induction postulate. The set of standard points is simply not definable 
in 02.) 



Chapter 4: Second-Order Logic 


287 


Exercises 

1. Show that any structure for the language with parameters V, 0, and 
S that satisfies the sentences 

V.rS.r^O (SI) 

VZ Vy(Sx = Sv -*■ x = y) (S2) 

and the Peano induction postulate 

VZ(Z0 A V y(Xy XSy) V v Xy) 
is isomorphic to tTlj = (N; 0, S). 

2. (a) Give a sentence in the second-order language of equality that is 

true in a set iff its cardinality is Kq. 

(b) Do the same for N |. 

3. Let ip be a formula in which only the n -place predicate variable X 
occurs free. Say that an n-ary relation R on |2l| is implicitly defined 
in 21 by <p iff 21 satisfies ip with an assignment of R to X but does 
not satisfy <p with an assignment of any other relation to X. Show 
that tt Th (ft, the set of Godel numbers of first-order sentences true 
in hi. is implicitly definable in hi by a formula without quantified 
predicate or function variables. Suggestion: The idea is to write down 
conditions that the set of true sentences must meet. 

4. Consider a language (with equality) having the one-place predicate 
symbols I and S and the two-place predicate symbol E. Find a 
second-order sentence a such that (i) if A is a set for which AnVA — 
0 and if |2t| = AUPA,/ a = A,S a = VA,E % = {( a,b) \ a e b <z 
A}, then 21 is a model of er; and (ii) every model of a is isomorphic 
to one of the sort described in (i). Remark: Roughly speaking, a 
translates “S — VI .” 


SECTION 4.2 _ 

Skolem Functions 

We want to show how, given any first -order formula, one can find 
a logically equivalent prenex second-order formula of a very special 
form: 


existential quantifiers 


universal individual 
quantifiers 


quantifier-free formula 
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This is a prenex formula wherein all universal quantifiers are indi¬ 
vidual ones that follow a string of existential individual and function 
quantifiers. 

In the simplest example, observe that 

Vx 3 y (p(x, y) |==| 3FVx cp(x, Fx). 

In the “=|” direction this is easy to see. For the “|=” direction, consider 
a structure 21 and an assignment function s satisfying V.v 3 y <p(x,y). 
We know that for any a e |2t| there is at least one b e |2l| such that 

Na y)[s(* I a)(y \ b)]. 

We obtain a function / on |2l| by choosing one such b for each a and 
taking f(a) = b. (The axiom of choice is used here.) Then 

\= m Vx<p(x,Fx)[s(F\ /)]. 

This function / is called a Skolem function for the formula V.v 3 y (p in 
the structure 21. 

The same argument applies more generally. As a second example, 
suppose that we begin with the formula 

3 vi Vxi 3 y 2 Vx 2 Vx 3 3 y 3 i jf{y\ , y 2 , ^ 3 )- 

(We have listed only y \, y 2 , and y 3 , but possibly other variables occur 
free in x[r as well.) Here we already have the existential quantifier 3 yi 
at the left. What remains is 

Vxi 3 y 2 Vx 2 Vx 3 3 y 3 t/f(yi, y 2 , y 3 ). 

This is a special case of the first example (with cp(x i, y 2 ) = Vx 2 VX 3 
3 y 3 f(yi, y 2 , V3)). It is logically equivalent, as before, to 

3 F 2 Vxi V x 2 Vx 3 3 y 3 \/f(y u F 2 Xi, y 3 ). 

Now we have the existential quantifiers 3 y\ 3 F 2 at the left; what re¬ 
mains is 


V X! Vx 2 Vx 3 3 y 3 x// (yi , F 2 x x , y 3 ). 

By the same reasoning as before, this is logically equivalent to 

3 F 3 Vx 1 Vx 2 Vx 3 xjr (yi, F 2 xi, F 3 x\ x 2 x 3 ), 

where F 3 is a three-place function variable. Thus the original formula 
is equivalent to 

3yi 3F 2 3F 3 Vxi Vx 2 Vx 3 \jr(yu F 2 x x , F 3 x\x 2 x 3 ). 

For quantifier-free 1 //, this is in the form we desire. 

Skolem Normal Form Theorem For any first-order formula, we can 
find a logically equivalent second-order formula consisting of: 
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(a) First a string (possibly empty) of existential individual and 
function quantifiers, followed by 

(b) A string (possibly empty) of universal individual quantifiers, 
followed by 

(c) A quantifier-free formula. 

A formal proof could be given using induction, but the preceding 
example illustrates the general method. 

Recall that a universal (Vi) formula is a first-order prenex formula 
all of whose quantifiers are universal: Vxi Vx 2 • • • Vx*a, where a is 
quantifier-free. Similarly, an existential (3j) formula is a first-order 
prenex formula all of whose quantifiers are existential. 

Corollary 42A For any first-order cp , we can find a universal for¬ 
mula 0 in an expanded language containing function symbols, 
such that <p is satisfiable iff 6 is satisfiable. 

By applying this corollary to -i <p, we obtain an existential formula 
(with function symbols) that is valid iff cp is valid. 

Proof. Again we will only illustrate the situation by an example. 
Say that cp is 

3 y i V xi 3 y 2 V x 2 V x 3 3 y 3 f Oh. yi > 3b) • 

First we replace <p by the logically equivalent formula in Skolem 
form: 

3 yi 3F 2 3F 3 Vxi Vx 2 Vx 3 \j/ (vi, F 2 x\, F 3 xix 2 x 3 ). 

Then for 0 we take 

Vxi Vx 2 Vx 3 1 [r(c, fx i, gxix 2 x 3 ), 

where c, /, and g are new function symbols having zero, one, and 
three places, respectively. In general 6 is not logically equivalent 
to i p. But we do have 0 (= (p (in the expanded language). And 
any model 21 of <p can be expanded (by defining c 21 , / a , and 
g 21 correctly) to be a model of 0. Thus <p and 0 are “equally 
satisfiable.” H 

This result reduces the general problem of testing first-order formulas 
for satisfiability to the special case of universal formulas (with function 
symbols). And by the same token, it reduces the problem of testing 
for validity to the 3i case. From these reductions we can derive an 
undecidability result for first-order logic: 

Corollary 42 B Consider a recursively numbered language having 
a two-place predicate symbol and infinitely many A'-place function 
symbols for each k > 0. 
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(a) The set of Godel numbers of satisfiable universal (first-order) 
sentences is not recursive. 

(b) The set of Godel numbers of valid existential (first-order) 
sentences is not recursive. 

Proof, (b) Given any sentence a we can, by applying Corollary 42A 
to —i a, effectively find an existential sentence that is valid iff a 
is valid. Hence a decision procedure for the existential validities 
would yield a decision procedure for arbitrary validities, in con¬ 
tradiction to Church’s theorem. H 

We can use predicate variables instead of function variables in these 
results, but at a price. Suppose we begin with a first-order formula. 
It is equivalent to a formula i/r in Skolem normal form; suppose for 
simplicity that i (r — 3 F tp, where tp has only individual quantifiers and 
F is a one-place function variable. We can choose ip in such a way that 
F occurs only in equations of the form u = Ft (for terms t and u not 
containing F). This can be done by replacing, for example, an atomic 
formula a(Ft) by either x{x — Ft —»• a(x )) or 3 x(x — Ft A a(x)). 
Next observe that a formula 

3 F _ u — Ft _, 

wherein F occurs only in the form shown, is equivalent to 
3X(Vy 3 IzXyz A _Xtu_). 

If one pursues this question (as we will not do here) one finds that 
any first-order formula is logically equivalent to a second-order formula 
consisting of 

(a) A string of existential predicate quantifiers, followed by 

(b) A string of universal individual quantifiers, followed by 

(c) A string of existential individual quantifiers, followed by 

(d) A quantifier-free formula. 

There are corresponding versions — see Exercise 4 — of Corollar¬ 
ies 42A and 42B. The analogue of Corollary 42A reduces the problem 
of testing first-order formulas for satisfiability to the special case of V 2 
formulas (with predicate symbols). The problem of testing for validity 
is reduced to the 32 case. 

The analogue of Corollary 42B can be compared with Exercise 10 
in Section 2.6, where it is shown that the set of \A validities without 
function symbols is decidable. 

Herbrand Expansions 

We have seen (in Corollary 42A) how, given a formula of first-order 
logic, to find an “equally satisfiable” universal formula. And thus 
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(Corollary 42B) the question of satisfiability in first-order logic is 
reducible to the question of satisfiability of universal formulas. 

Now we go one step further: satisfiability of these universal formulas 
is reducible — but in a weaker sense — to satisfiability in sentential 
logic. 

Example. We know that V x 3 y Pxy 3 v V .r Pxy . But pretend 
we did not know this, and that we are interested in determin¬ 
ing whether or not logic implication holds here. That is equiv¬ 
alent to determining whether or not the hypothesis V.v 3 yPxy 
together with the negation of the conclusion —i 3 y V x Pxy is 
unsatisfiable. 

By the Skolem normal form theorem, we can replace these sen¬ 
tences by certain logically equivalent sentences; we wish to deter¬ 
minewhether or not 3 F\/x PxFx together with 3 G V y -i PGyy 
is unsatisfiable. And as in Corollary 42A, we replace these sen¬ 
tences by equally satisfiable universal sentences; we wish to de¬ 
termine whether or not the set {V.r Pxfx, V y Pgy v} is unsat¬ 
isfiable (where / and g are new function symbols). 

But this set of universal sentences is satisfiable, and moreover, 
the set can be made to generate its own model. Here is how. For 
the universe of our model we will use the Herbrand universe H, 
which is the set of all terms (in the language with / and g). Thus 
H contains, for each variable u, the terms 


M, fu, gu, ffu, fgu,.... 

Let A be the set of all instances of the universal sentences, that 
is, the formulas obtained by dropping all the universal quantifiers 
and plugging in (for the universally quantified variables) arbitrary 
terms from the Herbrand universe. Thus A contains, for each 
variable u, the quantifier-free formulas 

Pufu, Pgufgu ,..., —i Pguu, —iPgfufu,.... 

Now we consider A from the point of view of sentential logic. 
The sentence symbols are the atomic formulas, e.g., Pgfufu. 
And in this example A is satisfiable in sentential logic. That is, 
there is a truth assignment v on the set of sentence symbols so 
that v(a) = T for every a in A. Here is one such i>: 

Jr ifq is shorter than tj 
V l]t2 | f if t\ is at least as long as t 2 

Finally, we use this truth assignment v (in sentential logic) to 
make a structure Sj (in first-order logic) that will be a model of 
the universal sentences. The universe is the Herbrand universe: 
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\S)\ — II. (There are echos of the completeness proof of Sec¬ 
tion 2.5 here.) The function symbols are interpreted autony- 
mously — as naming themselves: f®(t) is ft and g^(t) is gt. 
Where v comes in is to interpret the predicate symbol P: 

(th,h)eP <=> v(Pt x t 2 ) = T 

This structure works. First VxPxfx because for every 
term t in the Herbrand universe, (t, ft) e P ”. Secondly 
Ntj V y —i Pgy y because for every term t in the Herbrand uni¬ 
verse, (gt, t) f P®. 

We conclude that the hypothesis V x 3 y Pxy together with the 
negated conclusion -i 3 y V x Pxy is indeed satisfiable, and hence 
V x 3 v Pxy 3 y V x Pxy. 

To what extent can we generalize from this example? Assume, for 
simplicity, that the language does not contain equality. (Exercise 7 indi¬ 
cates the changes needed to accommodate equality.) Suppose we want 
to determine whether or not T |= <p for a set I’: <p of formulas in first- 
order logic. This is equivalent to determining whether or not the set 
T; -up is unsatisfiable. 

We can replace each of the formulas here by a logically equivalent 
formula in Skolem normal form. And as in Corollary 42A, we get an 
equally satisfiable set T* of universal formulas. (In applying Skolem 
normal form, we use different Skolem function symbols for each for¬ 
mula, so that there is no clash between formulas.) This brings us to the 
situation: 


r 1= <p •<=>■ is unsatisfiable 

and is a set of universal formulas. 

Let H be the Herbrand universe, that is, the set of all terms in the 
language of T. Let A be the set of all instances of the universal formulas 
in T (i.e., formulas obtained by dropping all the universal quantifiers and 
plugging in for the universally quantified variables arbitrary terms from 
the Herbrand universe). Then A consists of quantifier-free formulas 
only. We consider A from the viewpoint of sentential logic, where the 
sentence symbols are the atomic formulas. 

Case I: A is unsatisfiable in sentential logic. In this case, we can 
conclude that T' is unsatisfiable and T |= tp in first-order logic. This 
is because a universal formula logically implies all of its instances. 
Therefore |= S for every 8 in A (in first-order logic). Any model of T 
must be a model of A. But from a model 21 of A we can extract a truth 
assignment v that satisfies A in sentential logic. (Remember Exercise 3 
in Section 2.4. Note the interesting interplay between first-order logic 
and sentential logic.) 
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Case II: A is satisfiable in sentential logic, say by the truth assign¬ 
ment v. Then we will use v to make a structure Sj in which T is satisfiable 
and T \/= cp because f) will give a counterexample. 

As in the example, the universe |f)| is the Herbrand universe //, the 
set of all terms in the language of And again the function symbols 
are interpreted autonymously: f^ft\ ,..., t n ) — ft] •••?„. To interpret 
a predicate symbol P we use the truth assignment v: 

ih -- r„> e P 55 v(Pti ■■■t n ) = T 

Then we claim that every formula in T is satisfied in f) by the identity 
function = x on the variables. First, note that sft) — t for any term 
t in H ; we had the same situation in step 4 of the completeness proof 
in Section 2.5. Secondly, note that for an atomic formula Pt\ •••/„, 

l=S 5 Ph---t n [s] {t \,..., t n ) e P 5 * <^=A v(Pt\ ■ ■ ■ t„) — T. 

By Exercise 3 of Section 2.4 again, any formula S in A is satisfied in Sj 
with s (because tT(S) = T). 

Consider any formula in T. It is a universal formula; to simplify the 
notation, say it is V V\ V tb 9(v\, V 2 , vj), where 6 is quantifier-free. We 
need to check for any terms t\ and f? in PI that \=% 0\\t \, t 2 , [|. This 

is equivalent (by the substitution lemma) to saying that the formula 
0(t\. h, vj) is satisfied in f) by .v. But this formula is an instance of 
V V \ V V 2 9(v i, ib, W 3 ), and so Oft 1 , t 2 , Vj,) is in A. As noted above, our 
construction was arranged so that every formula in A is satisfied in Sj 
with s. This is what we needed. 

We can summarize the result as follows. For simplicity, the result is 
stated only for sentences. 

Herbrand's Theorem Consider a set F; <p of sentences in a first- 
order language without equality. Let A be as above. Then either 
(case I) A is unsatisfiable in sentential logic and T (= (p, or (case II) 
A is satisfiable in sentential logic and the structure S) constructed 
above is a model of T in which <p is false. 

(Herbrand’s work was in his 1930 dissertation, completed not long 
before he was killed in a mountain-climbing accident. His statement of 
the theorem was quite different from this one, but the ideas follow his 
work, and 1928 work of Thoralf Skolem.) 

In case I, by the compactness theorem of sentential logic, some finite 
subset of A is unsatisfiable. This fact can be used to obtain a proof of 
the compactness theorem for first-order logic, not relying on Section 
2.5 or the deductive calculus of Section 2.4. 

Moreover, a proof of the enumerability theorem, similarly indepen¬ 
dent of Sections 2.4 and 2.5, can be extracted from the Herbrand ap¬ 
proach. Take the special case T = 0. If <p is valid then as we produce 
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more and more of A, at some point we will have an unsatisfiable set, 
a fact we can recognize by use of truth-tables. If tp is not valid, then 
as we produce more and more of A, we are making a structure in 
which <p fails, but the structure is infinite and the construction never 
terminates. 


Exercises 

1. Prove the Lowenheim-Skolem theorem in the following improved 
form: Let 21 be a structure for a countable language. Let S be a 
countable subset of |2t|. Then there is a countable substructure 23 of 
21 with S c 1231 with the property that for any function s mapping 
the variables into 1231 and any (first-order) cp. 

Hi <?[*] iff H <£>[s]. 

Suggestion: Choose Skolem functions for all formulas. Close S under 
the functions. Remark: A substructure 23 with this property is said 
to be an elementary substructure. Note that the property implies (by 
taking <p to be a sentence) that 21 = 23. On the one hand, this form 
gives a stronger conclusion than we had in Section 2.6. Not only 
do we get that Th 21 has some countable model, we get a countable 
^nfomodel. On the other hand, the proof uses the axiom of choice, 

2. Extend the previous exercise to the uncountable case. Assume that 21 
is a structure for a language of cardinality X. Let S be a subset of |2l| 
having cardinality k. Show that there is an elementary substructure 
23 of 21 of cardinality at most k + X with S C 1231. 

3. Show that Corollary 42B is optimal in the following sense: 

(a) Given any 3 \ sentence a we can effectively decide whether or 
not a is satisfiable. 

(b) Given any V i sentence a we can effectively decide whether or 
not a is valid. 

4. (a) State the two corollaries (analogous to 42A and 42B) described 

at the end of this section. 

(b) Supply proofs. 

5. Repeat the example given for Herbrand expansions, but for the con¬ 
verse: 3 yVxPxy |= Vx 3 yPxy. Show that in this case the set A 
is unsatisfiable in sentential logic. 

6. Apply the method of Herbrand expansions to establish the following: 
|= 3 x(Px —y VxPx). 

7. Modify the Herbrand expansion construction to accomodate a lan¬ 
guage with equality. Suggestion: In effect, step 5 of the completeness 
proof in Section 2.5 must be added. Add enough universal sentences 
to assure that {(fi, ? 2 ) I u(fi = ti) — T is a congruence relation. 
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SECTION 4.3 _ 

Many-Sorted Logic 

We now return to first-order languages, but with many sorts of variables, 
ranging over different universes. (In the next section this will be applied 
to the case in which one sort of variable is for elements of a universe, 
another for subsets of that universe, yet another for binary relations, and 
so forth.) 

In informal mathematics one sometimes says things like, “We use 
Greek letters for ordinals, capital script letters for sets of integers,....” 
In effect, one thereby adopts several sorts of variables, each sort having 
its own universe. We now undertake to examine this situation precisely. 
As might be expected, nothing is drastically different from the usual 
one-sorted situation. None of the results of this section are at all deep, 
and most of the proofs are omitted. 

Assume that we have a nonempty set /, whose members are called 
sorts, and symbols arranged as follows: 

A. Logical symbols 

0. Parentheses: (,). 

1. Sentential connective symbols: -i, —►. 

2. Variables: For each sort i, there are variables v\ , v l 2 , ... of 
sort i. 

3. Equality symbols: For some i e I there may be the symbol 
=,-, said to be a predicate symbol of sort (/, i). 

B. Parameters 

0. Quantifier symbols: For each sort i there is a universal quan¬ 
tifier symbol V,. 

1. Predicate symbols: For each n > 0 and each n-tuple (ii,..., i n ) 
of sorts, there is a set (possibly empty) of n -place predicate 
symbols, each of which is said to be of sort {i\, ..., i n ). 

2. Constant symbols: For each sort i there is a set (possibly 
empty) of constant symbols each of which is said to be of 
sort i. 

3. Function symbols: For each n > 0 and each (n + 1)-tuple 
{ii,..., i„, i„+i) of sorts, there is a set (possibly empty) of 
n -place function symbols, each of which is said to be of sort 

{1 1, • • ■ 5 In i in+l } • 

As usual, we must assume that these categories of symbols are 
disjoint, and further that no symbol is a finite sequence of other 
symbols. 

Each term will be assigned a unique sort. We define the set of terms 
of sort i inductively, simultaneously for all i: 
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1. Any variable of sort i or constant symbol of sort i is a term of 
sort i. 

2. If ti,, t„ are terms of sort i\,... ,i n , respectively, and / is a 
function symbol of sort (i\,..., i n , ;„+i > then ft\ ■ ■ ■ t n is a term of sort 
b;+l • 


This definition can be recast into a more familiar form. The set of 
pairs (t, i) such that t is a term of sort i is built up from (i.e., generated 
from) the basic set 

{{v ' n , i) | n > 1 & i e /} U {(c, i) \ c is a constant symbol of sort i } 

by the operations that, for a function symbol / of sort (i i,..., i n , i n+ \) , 
produce the pair (ft\ ■ ■ ■ t n , i n +i) from the pairs {t\. ij ),... ,{t„, i n ). 

An atomic formula is a sequence Pt\ ■ ■ ■ t„ consisting of a predicate 
symbol of sort {i\,... ,i n ) and terms t\,... ,t n of sortrespec¬ 
tively. The nonatomic formulas are then formed using the connectives 
-i, —*• and the quantifiers V, v] r 

A many-sorted structure 21 is a function on the set of parameters 
which assigns to each the correct type of object: 

1. To the quantifier symbol V,-, 21 assigns a nonempty set |2l|; called 
the universe of 21 of sort i. 

2. To each predicate symbol P of sort (i \,..., ;„), 21 assigns a 
relation 


c 1211,-, x ••• x |2l|,„. 

3. To each constant symbol c of sort i, 21 assigns a point c 21 in |2l|,-. 

4. To each function symbol / of sort (i\,..., i„, i„+i). 21 assigns a 
function 


/ a : 1211;, x ••• x |2l|,„ -► |2t| in+1 . 

The definitions of truth and satisfaction are the obvious ones, given 
that V, is to mean “for all members of the universe |2l|; of sort i.” 

In a many-sorted structure, the universes of the various sorts might or 
might not be disjoint. But since we have no equality symbols between 
sorts, any nondisjointness must be regarded as accidental. In partic¬ 
ular, there will always be an elementarily equivalent structure whose 
universes are disjoint. 

Reduction to One-Sorted Logic 

Many-sorted languages may at times be convenient (as in the following 
section). But there is nothing essential that can be done with them that 
cannot already be done without them. We now proceed to make this 
assertion in a more precise form. 
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We will consider a one-sorted language having all the predicate, 
constant, and function symbols of our assumed many-sorted language. 
In addition, it will have a one-place predicate symbol Q, for each i in I. 
There is a syntactical translation taking each many-sorted formula <p 
into a one-sorted formula (p *. In this translation all equality symbols are 
replaced by =. The only other change is in the quantifiers (the quantifier 
symbols and the quantified variables): We replace 

v, 

by 

V v(QiV —»■ v_), 

where v is a variable chosen not to conflict with the others. Thus the 
quantifiers of sort i are "relativized” to Q,. (The free variables are left 
alone.) 

Turning now to semantics, we can convert a many-sorted structure 
21 into a structure 21* for the above one-sorted language. The universe 
121*| is the union (J. g/ |2l|, of all the universes of 21. To Q l is assigned 
the set 1211, . On the predicate and constant symbols, 21* agrees with 21. 
For a function symbol /, the function f"' 1 is an arbitrary extension of 
/ a . (Of course this last sentence does not completely specify f' 21 . The 
results we give for 21* hold for any structure obtained in the manner just 
described.) 

Lemma 43A A many-sorted sentence a is true in 21 iff a * is true 
in 21*. 

To prove this, one makes a stronger statement concerning formulas: 

l=a *>[*] <=► ha* <P*\s] 

where s(v' n ) e |21|,. The stronger statement is then proved by induction. 

Consider now the other direction. A one-sorted structure is not al¬ 
ways convertible into a many-sorted structure. So we will impose some 
conditions. Let <l> be the set consisting of the following one-sorted 
sentences: 

1. 3 vQjV, for each i in I. 

2. V vi ■ ■ ■ V v n {QiV i —>•-> Qi„v, , —> Q in+l fV] ■ ■ ■ v n ), for each 

function symbol / of sort (i i,..., i„, i„+\). We include the case n — 0, 
in which case the above becomes the sentence QjC for a constant symbol 
c of sort i. 

Notice that the above 21* was a model of O. A one-sorted model IB 
of 4 1 does convert into a many-sorted 23®. The conversion is performed 
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in the natural way: 

l® # l i = Q?\ 

= p® n (<2® x • • • x (Qf), where P is a predicate symbol 
of sort (i i,..., i n ) 
c® # = c®; 

/® B = /® n (2® x • • • x 2® x <2® +1 ), the restriction of 

/® to 2® x • • • x g ( ®, where / is a function symbol 
of Sort (/ 1 , . . . , i n • i n | ] ) - 

Lemma 43B If 23 is a model of O, then 23* is a many-sorted struc¬ 
ture. Furthermore, a many-sorted sentence a is true in 23 : iff er* 
is true in 23. 

The proof is similar to the proof of Lemma 43A. 

Notice that 23** is not in general equal to 23. (For example, 1231 may 
contain points not belonging to any Qf.) On the other hand, 21* : is equal 
to 21. 

Theorem 43C In the many-sorted language 

E |= a 

iff in the one-sorted language 

E*UO |=cr*. 

Proof. (=?) Assume that E \= a and let 23 be a one-sorted model 
of E* U 4> (where E* = {a* | a e E}). Then 23 # is a model of 
E by Lemma 43B. Hence 23 : is a model of a. So by Lemma 43B 
again, (8 is a model of a*. 

(<=) Similar, with Lemma 43A. H 

By using Theorem 43C, we can now infer the following three theo¬ 
rems from the corresponding one-sorted results. 

Compactness Theorem If every finite subset of a set E of many- 
sorted sentences has a model, then E has a model. 

Proof. Assume that every finite subset Eo of E has a many-sorted 
model 21 q. Then a finite subset Eq of E* has the model 21 q. Hence 
by the ordinary compactness theorem, E* has a model 23. 23* is 
then a model of E. H 

Enumerability Theorem For a recursively numbered many-sorted 
language, the set of Godel numbers of valid sentences is recur¬ 
sively enumerable. 

Proof. For a many-sorted a, we have by Theorem 43C, 

|= O iff <J> |= O'*. 
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Since $ is recursive, Cn<J> is recursively enumerable. And a* 
depends recursively on a, so we can apply Exercise 7(b) of Sec¬ 
tion 3.5. H 

Lowenheim-Skolem Theorem For any many-sorted structure (for a 
countable language) there is an elementarily equivalent countable 
structure. 

Proof. Say that the given structure is 21. Then 21* is a one-sorted 
model of (Th 21) * U <t>. Hence by the ordinary Lowenheim-Skolem 
theorem, (Th2l)* U <J> has a countable model 23. 03® is a model of 
Th 21 and so is elementarily equivalent to 21. H 


SECTION 4.4 _ 

General Structures 

We now return to the discussion of second-order logic begun in Sec¬ 
tion 4.1. We discussed there (a) the syntax, i.e,, the set of wffs for second 
order, and (b) the semantics, i.e,, the concept of structure (which was 
the same as for first order) and the definition of satisfaction and truth. 

In this section we want to leave (a) unchanged, but we want to present 
an alternative to (b). The idea can be stated very briefly: We view the 
language (previously thought of as second-order) now as being a many- 
sorted elementary (i.e., first-order) language. The result is to make open 
to interpretation not only the universe over which individual variables 
range but also the universes for the predicate and function variables. This 
approach is particularly suited to number theory; that case is examined 
briefly at the end of this section. 

The Many-Sorted Language 

Despite the fact that we want ultimately to consider the grammar of Sec¬ 
tion 4.1, it will be expedient to consider also a many-sorted (first-order) 
language constructed from the second-order language of Section 4.1. 
We take No sorts: the one individual sort (with variables V \ , i>2, ■ • .); for 
each n > 0, the 77 -place predicate sort (with variables X" , X'f ...); and 
for each n > 0, the n-place function sort (with variables F ", T". ...). 
We will use equality (=) only between terms of the individual sort. 
The predicate and function parameters of our assumed second-order 
language will also be parameters of the many-sorted language, and will 
take as arguments terms of the individual sort. (For a function parameter 
/, the term ft is of the individual sort. The only terms of predicate or 
function sort are the variables of those sorts.) 

In addition, we now use two new classes of parameters. For each 
n > 0 there is a membership predicate parameter e n which takes as 
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arguments one term of the n-place predicate sort (i.e., a variable X" n ) 
and n terms of the individual sort. Thus, for example, 

SiX 3 v 2 v x v & 

is a wff. Its intended interpretation is that the triple denoted by ( v 2 , V] , v%) 
is to belong to the relation denoted by X 3 . This is exactly the interpre¬ 
tation assigned previously to the second-order formula 

X i v 2 v l v 8 , 

and, in fact, readers are advised to identify these two formulas closely 
in their minds. 

For each n > 0, there is also the evaluation function parameter E n . 
E„ takes as arguments one term of the n-place function sort (i.e., a 
variable F" m ) and n terms of the individual sort. The resulting term, 

E„F"ti ■■■?„, 

is itself of the individual sort. Again readers are advised to identify 
closely the term E n F n t\ ■ ■ ■ t n with the previous F n t\ ■ ■ ■ t n . 

There is an obvious way of translating between the second-order 
language of Section 4.1 and the present many-sorted language. In one 
direction we stick on the e n and E n symbols; in the other direction we 
take them off. The purpose of these symbols is to make the language 
conform to Section 4.3. 

A many-sorted structure has universes for each sort and assigns suit¬ 
able objects to the various parameters (as described in the preceding 
section). First, we want to show that without loss of generality, we may 
suppose that s n is interpreted as genuine membership and E n as genuine 
evaluation. 

Theorem 44A Let 21 be a structure for the above many-sorted lan¬ 
guage such that the different universes of 21 are disjoint. Then 
there is a homomorphism h of 21 onto a structure 23 such that 

(a) h is one-to-one, in fact the identity, on the individual uni¬ 
verse (from which it follows that 

1 = 2 , q>[s] iff |=<b (p[h o s] 

for each formula q>). 

(b) The n-place predicate universe of 23 consists of certain n- 
ary relations over the individual universe, and (R, a \,..., a n ) is 
in e® iff {a \,..., a n ) e R. 

(c) The n -place function universe of 18 consists of certain 

n-place functions on the individual universe, and £ * (/, ci\,, 
a n ) = f(ai -- a n ). 

Proof. Since the universes of 21 are disjoint, we can define h on one 
universe at a time. On the individual universe U, h is the identity. 
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On the universe of the n-place predicate sort, 

h(Q) = {(«!,..., a n ) | each a,- is in U and 
( Q , a i, ..., «„} is in sf}. 

Thus 

(fli,..., a„) e h(Q) iff {Q, au ..., a n ) is in e*. (1) 

Similarly, on the universe of the n -place function sort, 

h(g) is the n-place function on U whose value at 
(a i, ■ ■ .,a„) is Ef(g, a x -- a n ). 

Thus 

h(g)(a i,..., a„) = E*(g, a\ -- a n ). (2) 

For e® we take simply the membership relation, 

(R, a \,..., a n ) is in e® iff (a \,..., a n ) e R. (3) 

For £® we take the evaluation function, 

£®(/,«!, = /(ai, ...,a n ). (4) 

On the other parameters (inherited from the second-order lan¬ 
guage) IB agrees with 21. 

Then it is clear upon reflection that h is a homomorphism of 
21 onto 23. That It preserves s„ follows from (1) and (3), where in 
(3) we take R — h(Q). Similarly, from (2) and (4) it follows that 
h preserves E„. 

Finally, we have to verify the parenthetical remark of part (a). 
This follows from the many-sorted analogue of the homomor¬ 
phism theorem of Section 2.2, by using the fact that we have 
equality only for the individual sort, where h is one-to-one. H 

By the above theorem, we can restrict attention to structures 23 in 
which e„ and /:„ are fixed by (b) and (c) of the theorem. But since e® 
and Zs® are determined by the rest of IB, we really do not need them 
at all. When we discard them, we have a general pre-structure for our 
second-order grammar. 

General Structures for Second-Order Languages 

These structures provide the alternative semantics mentioned at the be¬ 
ginning of this section. 

Definition. A general pre-structure 21 for our second-order lan¬ 
guage consists of a structure (in the original sense), together with 
the additional sets: 

(a) For each n > 0, an n-place relation universe, which is a 
set of n-ary relations on |2t|; 
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(b) For each n > 0, an «-place function universe, which is a 
set of functions from |2l|" into |2l|. 

21 is a general structure if, in addition, all comprehension sen¬ 
tences are true in 21. 

The last sentence of the definition requires explanation. First, a com¬ 
prehension sentence is a sentence obtained as a generalization of a 
comprehension formula (see Example 3 of Section 4.1). Thus it is a 
generalization of 

3 X n V Vi ■ ■ ■ V v n ( X" V\ ■ ■ ■ v n «-► (p), 
where X" does not occur free in <p, or a generalization of 

Vt) r "Vn,3!»„ +1 ^ 

3 F n V vi ■ ■ ■ V v n+ i (F n vi---v n = Vn+\ <-► f), 

where F" does not occur free in i jr. (Here <p and i// can have individual 
variables, predicate variables, and function variables.) 

Next we must say what it means for a comprehension sentence (or 
any second-order sentence for that matter) to be true in 21. Assume then 
that 21 is a general pre-structure. Then a sentence a is true in 21 iff the 
result of converting a into a many-sorted sentence (by adding e„ and 
E„ ) is true in 21, with e„ interpreted as membership and E n as evaluation. 

More generally, let <p be a second-order formula, and let s be a func¬ 
tion that assigns to each individual variable a member of |2l|, to each 
predicate variable a member of the relation universe of 21, and to each 
function variable a member of the function universe of 21. Then we say 
that 21 satisfies <p with s (written v?[s]) iff the many-sorted version 
of (p is satisfied with s in the structure 21, where e n is interpreted as 
membership and E n as evaluation. 

The essential consequences of this definition of satisfaction are the 
following, which should be compared with 5 and 6 of page 283. 

|=§ V X n </>[.?] iff for every R in the n-place relation 
universe of 21, (p[s(X n \ R)]. 

|=§ V F" <p[s ] iff for every / in the n-place relation 
universe of 21, q>[s(F" \ /)]. 

This, then, is the alternative approach mentioned at the beginning 
of the section. It involves treating the second-order grammar as being 
a many-sorted first-order grammar in disguise. Because this approach 
is basically first-order, we have the Lowenheim-Skolem theorem, the 
compactness theorem, and the enumerability theorem. 

Lowenheim-Skolem Theorem If the set E of sentences in a count¬ 
able second-order language has a general model, then it has a 
countable general model. 
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Here a countable general model is one in which every universe is 
countable (or equivalently, the union of all the universes is countable). 

Proof. Let T be the set of comprehension sentences. Then EUT, 
viewed as a set of many-sorted sentences, has a countable many- 
sorted model by the Lowenheim-Skolem theorem of the pre¬ 
ceding section. By Theorem 44A, a homomorphic image of that 
model is a general pre-structure satisfying EUT, and hence is a 
general model of E. H 

Compactness Theorem If every finite subset of a set E of second- 
order sentences has a general model, then E has a general model. 

Proof. The proof is exactly as above. Every finite subset of E U F 
has a many-sorted model, so we can apply the compactness the¬ 
orem of the preceding section. H 

Enumerability Theorem Assume that the language is recursively 
numbered. Then the set of Godel numbers of second-order 
sentences that are true in every general structure is recursively 
enumerable. 

Proof. A sentence a is true in every general structure iff it is a 
many-sorted consequence of T. And 'T is recursive, H 

The above two theorems assure us that there is an acceptable deduc¬ 
tive calculus such that r is deducible from E iff r is true in every general 
model of E (see the remarks at the beginning of Section 2.4). But now 
that we know there is such a complete deductive calculus, there is no 
compelling reason to go into the detailed development of one. 

We can compare the two approaches to second-order semantics as 
follows: The version of Section 4.1 (which we will now call absolute 
second-order logic) is a hybrid creature, in which the meaning of the 
parameters is left open to interpretation by structures, but the concept of 
being (for example) a subset is not left open, but is treated as having a 
fixed meaning. The version of the present section (general second-order 
logic) avoids appealing to a fixed notion of subset, and consequently is 
reducible to first-order logic. In this respect it is like axiomatic set theory, 
where one speaks of sets and sets of sets and so forth, but the theory is 
a first-order theory. 

By enlarging the class of structures, general second-order logic di¬ 
minishes the cases in which logical implication holds. That is, if every 
general model of E is a general model of er, it then follows that E \= a 
in absolute second-order logic. But the converse fails. For example, 
take E = 0: The set of sentences true in all general models is a recur¬ 
sively enumerable subset of the nonarithmetical set of valid sentences 
of absolute second-order logic. 



304 


A Mathematical Introduction to Logic 


Models of Analysis 

We can illustrate the ideas of this section by focusing attention on the 
most interesting special case: general models of second-order number 
theory. Consider then the second-order language for number theory, with 
the parameters 0, S, <, •, and E. We take as our set of axioms the set A\ 
obtained from A E by adding as a twelfth member the Peano induction 
postulate (Example 2, Section 4.1). From Exercise 1 of Section 4.1, we 
can conclude that any model (in the semantics of that section) of A\ is 
isomorphic to 91. 

But what of the general models of our axiom set? They can differ 
from 91 in either (or both) of two ways. We can employ the compact¬ 
ness theorem as before to construct (nonstandard) general models of the 
axioms having infinite numbers (i.e., models 21 with an element larger 
than the denotation of S"0 in the ordering < a ). We can also find (non- 
absolute) general models in which, for example, the set universe (the 
unary relation universe) is less than the full power set of the individual 
universe. Indeed, any countable general model must be of this kind. 

It is traditional for logicians to refer to second-order number theory 
as analysis. The name derives from the fact that it is possible to identify 
real numbers with sets of natural numbers. In second-order number 
theory we have quantifiers over sets of natural numbers, which we can 
view as quantifiers over real numbers. The appropriateness of the name 
is nevertheless open to question, but its usage is well established. By 
a model of analysis we will mean a general model of the above axiom 
set A 2 e . 

Define an co-model of analysis to be a model of analysis in which 
the individual universe is N and the denotations of 0 and S are the 
genuine 0 and S. (Consequently, the denotations of <, +, •, and E are 
also standard.) The motivation for studying w-models can be stated as 
follows: We have a clear understanding — or so we think — of the set N. 
But we do not have anything like the same understanding of its power 
set PN. For example, we may be uncertain whether its cardinality is K i 
or tC or more. So it is reasonable to hold fixed that which we are sure 
of (N), but to leave open to interpretation by a structure that which we 
are not sure of {VN). 

Among the co-models of analysis there is the one absolute model, 
whose n-place relation universe consists of all n-ary relations on N (and 
whose function universes consist of all possible functions). A first-order 
sentence is true in an arbitrary co-model of analysis iff it is true in 91. 
But the co-model may disagree with the absolute model on second-order 
sentences. 

In the next theorem we assert that an co-model of analysis is com¬ 
pletely determined by its set universe (i.e., its one-place relation uni¬ 
verse). 
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Theorem 44B If 21 and $ are w-models of analysis having the same 
one-place relation universe, then 21 = 23. 

Proof. Suppose R belongs to the three-place relation universe of 
21. Let (R) be the “compression” of R into a unary relation: 

(R) = {(a, b , c) | (a, b, c) e R}. 

Our sequence-encoding function is recursive and hence is first- 
order definable in number theory by a formula q>. (R) is in the set 
universe of 21 by virtue of the comprehension sentence 

VX 3 3TC V u[X l u •*-» 3 Uj 3 in 3 v^((p(v V 2 , v 3 , u) 
/\x 3 v ] v 1 v- i y\. 

Thus (R) is in the set universe of 23; we unpack it by a similar 
argument. R is in the three-place relation universe of 23 by virtue 
of the comprehension sentence 

VX'BA 3 V V\ V U 2 V V3[X 3 viV2V} > 

3 u(<p(V], V 2 , U 3 , u) A X ] u)]. 

A similar argument applies to the function universes. H 

Consequently, we can identify an oi-model of analysis with its set 
universe (which is included in PN). Not every subclass of PN is then 
an w-model of analysis, but only those for which the comprehension 
sentences are satisfied. 

Examples of cj-models. We need only specify the set universe. 

1 . VN is the absolute model. 

2. Let (A; € 4 ) be a model of the usual axioms for set theory 
such that (i) the relation e,i is the genuine membership relation 
{(a, b) | a e A, b e A, and a e b} on the universe A, and 
(ii) A is transitive, i.e., if a e b e A, then a e A. Then the 
collection of all those subsets of N that belong to A is an w-modeI 
of analysis. 

3. For a class A c "PN, define IL4 to be the class of all sets 
B C N which are definable in the cd pre-structure with set uni¬ 
verse A by a formula of the language of second-order number 
theory, augmented by parameters for each set in A. Then define 
by transfinite recursion on the ordinals: 

Ao = 0, 

A a +1 — OA a , 

Ax = U ff <x A for limit k. 

By cardinality considerations we see that this stops growing 
at some ordinal /l for which Ap+\ — Ap. Let ySo be the least 
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such fi ; it can be shown (from the Lowenheim-Skolem theorem) 
that /3o is a countable ordinal. Ap 0 coincides with A a (the 
union being over all ordinals a) and is called the class of rami¬ 
fied analytical sets. It is an ru-model of analysis; the truth of the 
comprehension sentences follows from the fact that V>Ap (j C Ap 0 . 
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Lagrange’s theorem, 166 
Languages 

many-sorted, 299-301 
of equality, 285 
See also First-order 
languages, Formal 
languages, Second-order 
logic 

Least-zero operator, 216, 
220-221 

Leibniz, G. W. v., 173 
Length, 221 

Lindenbaum’s theorem, 246 
Linear connectives, 52 
Linear transformations, 99 
Literal, 59 
Lob’s theorem, 269 
Logical axioms, 110, 112, 

125 

recursiveness of, 232 
validity of, 131-134 
Logical implication, 88-99 
Logically equivalence, 88 
Logical symbols, 14, 

69-70 

Los-Vaught test, 157-160, 
190 

Lowenheim, Leopold, 151 
Lowenheim-Skolem 

theorem, 103, 151-155, 
190 

in many-sorted logic, 299 
in second-order logic, 285, 
302-303 

LST theorem, 154 
Lukasiewicz, Jan, 33 
See also Polish notation 


M 

Majority connective, 45 
Malcev, Anatolii, 145 
Many-one reducibility, 256 


Many-sorted logic, 295-299 
application to second-order 
logic, 299-301 
Many-valued logic, 20 
Map, 5 

Map coloring, 65, 146 
Material conditional, 21 
Membership predicate, 
299-300 

Meta-language, 89, 129 
Metamathematics, use of 
term, 69 

Metatheorems, 116-120 
Models, 80-99 
of analysis, 304-306 
of theories, 147-162 
Modus ponens, 66, 110-111, 
116 

Monotone connectives, 54 
Monotone recursion, 224 
//.-operator, 216, 220-221 


N 


Nand, 51 

Natural numbers, 2 

See also Number theory 
Negation symbol, 14, 17 
Newton, Isaac, 173 
Nonlogical symbols, 14 
Nonprime formulas, 114 
Nonstandard analysis, 
173-181 

algebraic properties, 
176-178 

construction of hyperreals, 
173-176 

convergence in, 178-180 
Nonstandard models, 
152-153, 183, 304 
Normal form theorem, 
for recursive functions, 
252-253 

Skolem, 288-289 
Notation, 77-79 
NP, 26, 101 
Number theory, 182 
language of, 70, 72, 182 
with addition, 196-197, 
280 

with exponentiation, 
202-205, 280 


with multiplication, 
276-281 

with ordering, 193-196, 
280 

with successor, 187-193, 
280 

Numerals, 183-184, 209 
Numeralwise determined 
formulas, 206, 
210-212 


O 

Object language, 89 
Occur free, 76-77 
tu-completeness, 223 
(//-consistency, 241, 245 
(//-models of analysis, 
304-306 

One-sorted logic, 296-299 
One-to-one functions, 5 
Onto, 5 

Operating system, 253 
Operations, 5 
Ordered n -tuples, 3-4 
Ordered pairs, 3, 4 
Ordering relations, 6, 93, 159, 
284 


P 

Pairing function, 220, 
277-278 

Pairwise disjoint set, 3 
Parameters, 14, 70 
Parameter theorem, 258-260, 
264 

Parentheses, use of, 33, 78 
Parity connective, 53 
Parsing algorithm 
in first-order logic, 
105-108 

in sentential logic, 29-33 
Parsing formulas, 29-33, 
107-108 

Parsing terms, 106-107 
Partial functions, 250 
Partial recursive functions. 
See Recursive functions, 
partial 
Partition, 6 



Index 


315 


Peano arithmetic (PA), 
269-270 

Peano induction postulate, 
193, 284, 286-287 
Periodic set, 201 
Permutation, 100 
Polish notation, 32-33, 74 
Polynomial-time decidable, 
26, 115 

Post, Emil, 47, 152, 261 
Power set, 2-3 
Predicate calculus. See 
First-order logic 
Predicate symbols, 70, 79, 

128 

Predicate variables, 282 
Prenex formulas, 160 
Prenex normal form, 160-161 
Presburger’s theorem, 
197-198 

Prime formulas, 114-115 
Prime implicants, 59 
Prime numbers, 91, 184, 
218-219 

Primitive recursion, 221-222, 
227 

Principia Mathematica 
(Whitehead and 
Russell), 152 
Proof, nature of, 109 
See also Deductive 
calculus 

Propositional logic, 14 
Proposition symbol, 14-15 

Q 

Quantifier capture, 113 
Quantifiers, 70 
bounded, 204,210-211 
elimination of, 190-192 
existential, 287, 288 
Quantifier symbol, universal, 
80 

Quotient structure, 140 


R 

Rabin, Michael, 201 
Ramified analytical sets, 306 
Range (of relation), 4 


Reasonable language, 
142-144 

See also Recursively 
numbered language 
Recursion, 32, 38^-4 
monotone, 224 
primitive, 221-222, 227 
Recursion theorem, 39—40, 
41^12 

Recursive functions, 247-250 
normal form, 248-250 
partial, 250-258, 262 
reduction of decision 
problems, 258-260 
register machines, 261-263 
Recursively axiomatizable 
theories, 233, 240 
Recursively enumerable 
(r.e.) relations, 233, 
238-241 

Recursively inseparable sets, 
245 

Recursively numbered 
language, 225 

Recursive relations, 207-210, 
232 

See also Recursively 
enumerable relations 
Reductio ad absurdum, 119, 
121 

Reducts of number theory, 
182-183, 193-202 
Reflexive relations, 5 
Register machines, 208, 
261-263 

Relation comprehension 
formulas, 284 
Relations, 4-6 
Relation universe, 301 
Relay circuits, 57 
Representable functions, 
212-217 

Representable relations, 
205-206 
and numeralwise 
determined formulas, 
206, 210-212 
weakly, 241-242 
Re-replacement lemma, 130 
Resolution, 53 
Restriction, 5, 221 
Rice’s theorem, 260 
Rigid structure, 98 


Robinson, Abraham, 173 
Root of tree, 7 
Rule El, 124-125, 145 
Rules of inference, 110 
Rule T, 118 


S 

Satisfaction of formulas, 23, 
83-86 

Satisfiable sets, 59-60, 134 
Schema, 286 

Schroder-Bernstein theorem, 
9 

Second-order logic 
absolute, 303 
and many-sorted logic, 
295-299 

general structures, 

299-306 

language of, 282-286 
Skolem functions, 145, 
287-290 

Segments of sequences, 4 
initial, 4 

terminal, 105-106 
Self-reference, 234-235, 315 
approach to 
incompleteness, 

184-186 

Semantics and syntax, 125 
Semidecidable set, 63 
Semidecision procedure, 63 
Sentences, 77, 79 
Sentence symbols, 14, 115 
Sentential connectives, 14, 
45-54 
binary, 51 
ternary, 51-52 
unary, 51 
0-ary, 50 
Sentential logic 
compactness, 24, 59-60 
connectives, 45-52 
language of, 13-19 
parsing algorithm, 29-33 
tautologies, 23 
truth assignments, 20-27 
Sequence encoding and 

decoding, 220, 277, 281 
Sequence number, 221 
Sequences, finite, 4 



316 


Index 


Sets 

concept, 1-2 
countable, 6 
disjoint, 3 

empty versus nonempty, 2 
finite, 6 

intersection of, 3 
ordered, 93 
pairwise disjoint, 3 
power, 2-3 
union of, 3 

Set theory (ST) 152, 157, 
161-162, 240-241, 
270-275 

Godel incompleteness 
theorems for, 274-275 
language of, 70, 71 
Sheffer stroke, 51 
Shepherdson, John C., 261 
Shepherdson-Sturgis 
machines, 261-263 
Simplification of formulas, 59 
Single-valued relations, 5 
Skolem, Thoralf, 145, 151, 
293 

functions, 145, 287-290 
Lowenheim-Skolem 
theorem, 151-155 
normal form, 288-289 
paradox, 152 
S-m-n theorem. See 
Parameter theorem 
Soundness theorem, 66, 131 
Spectrum, 101, 150, 285 
Standard part, 178 
Steinitz’s theorem, 158-159 
Strategy for deductions, 
120-126 
String, 4 

Strong undecidability, 237, 
272-273 
Structures, 80-81 

cardinality of, 153-154 
definability in, 90-92 
definability of a class of, 
92-94 

general, 299-306 
Sturgis, H. E., 261 
Subsets, 2 
Substitutability, 113 
Substitution, 28, 112-114 
and alphabetic variants, 

126 


lemma, 133-134 
of terms, 112, 129 
representability of, 228 
Substructures, 95-96, 294 
Sufficiently strong theory, 
246, 267 

Switching circuits, 54-59 
Symbols 

logical, 14, 69-70 
nonlogical, 14 
parameter, 71 
sentential connective, 14, 
45-54 

Symmetric relations, 5 
Syntactical translation, 
169-172 

Syntax, arithmetization of, 
224-234 

Syntax and semantics, 125 


T 

Tarski, Alfred, 152, 154, 159, 
286 

undefinability theorem, 
236, 240 

Tautological equivalence, 24 
Tautological implication, 23 
Tautologies, 23 

in first-order languages, 
114-116 

representability of, 

230-231 

selected list of, 26-27 
Term-building operations, 74 
Terminal segment, 105-106 
Terms, 74, 79 
parsing, 106-107 
representing, 226-227 
unique readability of, 107 
Ternary connectives, 51-52 
Theorem, concept of, 
110-111,117 
Theories, 155-160 
axiomatizable, 156 
finitely axiomatizable, 156 
interpretations between, 
164-172 

models of, 147-162 
of structures, 148, 152, 155 
Total function, 250 
T -predicate, 249 


Trakhtenbrot’s theorem, 151 
Transitive relations, 5 
Trees, 7 

of deduction, 116-117 
of well-formed formulas, 
17, 22, 75 

Trichotomy, 6, 93, 159, 194 
Truth, 20 

undefinability of, 236, 240 
Truth and models, in 

first-order logic, 80-99 
Truth assignments, 20-27 
Truth tables, 24-26 
Truth values, 20 
Turing, Alan, 208, 261 
Turing machine, 208 
Two-valued logic, 20 
Tychonoff’s theorem, 24 

U 

Ultraproducts, 142 
Unary connectives, 51 
Uncountable languages, 141, 
153-154 
Undecidability 

incompleteness and, 
234-235 

of number theory, 182-187 
of set theory, 272 
strong, 237, 272-273 
Undefinability theorem, 
Tarski, 236, 240 
Union, 3 

Unique existential quantifier, 
102, 165 

Unique readability theorem 
for formulas, 108 
for terms, 107 
in sentential logic, 40—41 
Universal formulas (Vi), 102 
Universal quantifier symbol, 
68, 70, 80 

Universe, of structures, 81 
Unsolvability of halting 
problem, 254 

V 

Valid formula, 88-89 

in second-order logic, 286 



Index 


317 


Variables, 70, 79 
bound, 80 
free, 76-77 
function, 282 
individual, 282-283 
predicate, 282 
Vaught’s test. See 
Los-Vaught test 
Vector spaces, 92, 99 


W 

Weakly representability, 
241-242 

Well-defined function, 165 
Well-formed formulas (wffs), 
12, 17-18, 75 
Well-ordering, 283 


Z 

Z-chains, 189-190, 197 
Zermelo-Fraenkel set theory, 
157,270 

0-ary connectives, 50 
0-place function symbols, 70 
Zorn’s lemma, 7, 60, 141 



